Dataset statistics
| Number of variables | 17 |
|---|---|
| Number of observations | 10000 |
| Missing cells | 2485 |
| Missing cells (%) | 1.5% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.4 MiB |
| Average record size in memory | 144.0 B |
Variable types
| Text | 8 |
|---|---|
| Numeric | 7 |
| Categorical | 1 |
| DateTime | 1 |
Id is highly overall correlated with VoteCount | High correlation |
VoteCount is highly overall correlated with Id and 2 other fields | High correlation |
Budget is highly overall correlated with VoteCount and 1 other fields | High correlation |
Revenue is highly overall correlated with VoteCount and 1 other fields | High correlation |
OriginalLanguage is highly imbalanced (69.2%) | Imbalance |
TagLine has 2413 (24.1%) missing values | Missing |
Popularity is highly skewed (γ1 = 20.16937129) | Skewed |
Id has unique values | Unique |
VoteAverage has 261 (2.6%) zeros | Zeros |
VoteCount has 260 (2.6%) zeros | Zeros |
Budget has 4472 (44.7%) zeros | Zeros |
RunTime has 137 (1.4%) zeros | Zeros |
Revenue has 4155 (41.5%) zeros | Zeros |
Reproduction
| Analysis started | 2023-12-10 15:07:27.447617 |
|---|---|
| Analysis finished | 2023-12-10 15:07:38.593574 |
| Duration | 11.15 seconds |
| Software version | ydata-profiling vv4.5.1 |
| Download configuration | config.json |
GenreIds
Text
| Distinct | 2278 |
|---|---|
| Distinct (%) | 22.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 156.2 KiB |
Length
| Max length | 43 |
|---|---|
| Median length | 36 |
| Mean length | 11.8537 |
| Min length | 2 |
Characters and Unicode
| Total characters | 118537 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 5 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1416 ? |
|---|---|
| Unique (%) | 14.2% |
Sample
| 1st row | [28, 12, 53] |
|---|---|
| 2nd row | [28, 53, 80] |
| 3rd row | [16, 28, 14] |
| 4th row | [28, 53] |
| 5th row | [53, 18] |
| Value | Count | Frequency (%) |
| 18 | 3785 | |
| 35 | 3004 | |
| 28 | 2776 | |
| 53 | 2659 | |
| 12 | 1884 | 7.2% |
| 10749 | 1576 | 6.0% |
| 27 | 1556 | 5.9% |
| 14 | 1343 | 5.1% |
| 10751 | 1337 | 5.1% |
| 16 | 1318 | 5.0% |
| Other values (10) | 4943 |
Most occurring characters
| Value | Count | Frequency (%) |
| , | 16181 | |
| 16181 | ||
| 1 | 13312 | |
| 8 | 11217 | |
| [ | 10000 | |
| ] | 10000 | |
| 5 | 7303 | |
| 2 | 6744 | |
| 7 | 6569 | |
| 3 | 6233 | 5.3% |
| Other values (4) | 14797 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 66175 | |
| Other Punctuation | 16181 | 13.7% |
| Space Separator | 16181 | 13.7% |
| Open Punctuation | 10000 | 8.4% |
| Close Punctuation | 10000 | 8.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 13312 | |
| 8 | 11217 | |
| 5 | 7303 | |
| 2 | 6744 | |
| 7 | 6569 | |
| 3 | 6233 | |
| 0 | 5381 | |
| 4 | 4029 | 6.1% |
| 9 | 2771 | 4.2% |
| 6 | 2616 | 4.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 16181 |
Space Separator
| Value | Count | Frequency (%) |
| 16181 |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 10000 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 10000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 118537 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| , | 16181 | |
| 16181 | ||
| 1 | 13312 | |
| 8 | 11217 | |
| [ | 10000 | |
| ] | 10000 | |
| 5 | 7303 | |
| 2 | 6744 | |
| 7 | 6569 | |
| 3 | 6233 | 5.3% |
| Other values (4) | 14797 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 118537 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| , | 16181 | |
| 16181 | ||
| 1 | 13312 | |
| 8 | 11217 | |
| [ | 10000 | |
| ] | 10000 | |
| 5 | 7303 | |
| 2 | 6744 | |
| 7 | 6569 | |
| 3 | 6233 | 5.3% |
| Other values (4) | 14797 |
Id
Real number (ℝ)
HIGH CORRELATION  UNIQUE 
| Distinct | 10000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 306598.19 |
| Minimum | 5 |
|---|---|
| Maximum | 1191902 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 156.2 KiB |
Quantile statistics
| Minimum | 5 |
|---|---|
| 5-th percentile | 887.95 |
| Q1 | 11391.25 |
| median | 117257 |
| Q3 | 535427.75 |
| 95-th percentile | 1020601 |
| Maximum | 1191902 |
| Range | 1191897 |
| Interquartile range (IQR) | 524036.5 |
Descriptive statistics
| Standard deviation | 350362.44 |
|---|---|
| Coefficient of variation (CV) | 1.1427414 |
| Kurtosis | -0.38422622 |
| Mean | 306598.19 |
| Median Absolute Deviation (MAD) | 116438.5 |
| Skewness | 0.91953223 |
| Sum | 3.0659819 × 109 |
| Variance | 1.2275384 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 299054 | 1 | < 0.1% |
| 803279 | 1 | < 0.1% |
| 10407 | 1 | < 0.1% |
| 13341 | 1 | < 0.1% |
| 12689 | 1 | < 0.1% |
| 306745 | 1 | < 0.1% |
| 1175873 | 1 | < 0.1% |
| 24248 | 1 | < 0.1% |
| 901 | 1 | < 0.1% |
| 531 | 1 | < 0.1% |
| Other values (9990) | 9990 |
| Value | Count | Frequency (%) |
| 5 | 1 | |
| 11 | 1 | |
| 12 | 1 | |
| 13 | 1 | |
| 14 | 1 | |
| 15 | 1 | |
| 16 | 1 | |
| 18 | 1 | |
| 19 | 1 | |
| 22 | 1 |
| Value | Count | Frequency (%) |
| 1191902 | 1 | |
| 1191885 | 1 | |
| 1191557 | 1 | |
| 1191556 | 1 | |
| 1191268 | 1 | |
| 1191086 | 1 | |
| 1190610 | 1 | |
| 1190581 | 1 | |
| 1190531 | 1 | |
| 1190476 | 1 |
OriginalLanguage
Categorical
IMBALANCE 
| Distinct | 50 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 156.2 KiB |
| en | |
|---|---|
| ja | 602 |
| ko | 319 |
| fr | 308 |
| es | 296 |
| Other values (45) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 20000 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 15 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | en |
|---|---|
| 2nd row | en |
| 3rd row | en |
| 4th row | en |
| 5th row | es |
Common Values
| Value | Count | Frequency (%) |
| en | 7498 | |
| ja | 602 | 6.0% |
| ko | 319 | 3.2% |
| fr | 308 | 3.1% |
| es | 296 | 3.0% |
| zh | 162 | 1.6% |
| it | 152 | 1.5% |
| cn | 135 | 1.4% |
| de | 78 | 0.8% |
| ru | 65 | 0.7% |
| Other values (40) | 385 | 3.9% |
Length
| Value | Count | Frequency (%) |
| en | 7498 | |
| ja | 602 | 6.0% |
| ko | 319 | 3.2% |
| fr | 308 | 3.1% |
| es | 296 | 3.0% |
| zh | 162 | 1.6% |
| it | 152 | 1.5% |
| cn | 135 | 1.4% |
| de | 78 | 0.8% |
| ru | 65 | 0.7% |
| Other values (40) | 385 | 3.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 7890 | |
| n | 7692 | |
| a | 646 | 3.2% |
| j | 602 | 3.0% |
| r | 393 | 2.0% |
| o | 352 | 1.8% |
| s | 333 | 1.7% |
| k | 327 | 1.6% |
| f | 319 | 1.6% |
| t | 287 | 1.4% |
| Other values (14) | 1159 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 20000 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 7890 | |
| n | 7692 | |
| a | 646 | 3.2% |
| j | 602 | 3.0% |
| r | 393 | 2.0% |
| o | 352 | 1.8% |
| s | 333 | 1.7% |
| k | 327 | 1.6% |
| f | 319 | 1.6% |
| t | 287 | 1.4% |
| Other values (14) | 1159 | 5.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 20000 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 7890 | |
| n | 7692 | |
| a | 646 | 3.2% |
| j | 602 | 3.0% |
| r | 393 | 2.0% |
| o | 352 | 1.8% |
| s | 333 | 1.7% |
| k | 327 | 1.6% |
| f | 319 | 1.6% |
| t | 287 | 1.4% |
| Other values (14) | 1159 | 5.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 7890 | |
| n | 7692 | |
| a | 646 | 3.2% |
| j | 602 | 3.0% |
| r | 393 | 2.0% |
| o | 352 | 1.8% |
| s | 333 | 1.7% |
| k | 327 | 1.6% |
| f | 319 | 1.6% |
| t | 287 | 1.4% |
| Other values (14) | 1159 | 5.8% |
OriginalTitle
Text
| Distinct | 9713 |
|---|---|
| Distinct (%) | 97.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 156.2 KiB |
Length
| Max length | 104 |
|---|---|
| Median length | 61 |
| Mean length | 15.6792 |
| Min length | 1 |
Characters and Unicode
| Total characters | 156792 |
|---|---|
| Distinct characters | 2095 |
| Distinct categories | 21 ? |
| Distinct scripts | 17 ? |
| Distinct blocks | 24 ? |
Unique
| Unique | 9453 ? |
|---|---|
| Unique (%) | 94.5% |
Sample
| 1st row | Expend4bles |
|---|---|
| 2nd row | The Equalizer 3 |
| 3rd row | Mortal Kombat Legends: Cage Match |
| 4th row | Mission: Impossible - Dead Reckoning Part One |
| 5th row | Nowhere |
| Value | Count | Frequency (%) |
| the | 2557 | 9.2% |
| of | 730 | 2.6% |
| a | 328 | 1.2% |
| 2 | 273 | 1.0% |
| and | 229 | 0.8% |
| in | 226 | 0.8% |
| 226 | 0.8% | |
| to | 162 | 0.6% |
| la | 142 | 0.5% |
| 3 | 104 | 0.4% |
| Other values (9629) | 22756 |
Most occurring characters
| Value | Count | Frequency (%) |
| 17714 | 11.3% | |
| e | 14622 | 9.3% |
| a | 9276 | 5.9% |
| o | 8352 | 5.3% |
| n | 7768 | 5.0% |
| r | 7701 | 4.9% |
| i | 7462 | 4.8% |
| t | 6900 | 4.4% |
| s | 5742 | 3.7% |
| l | 4893 | 3.1% |
| Other values (2085) | 66362 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 101826 | |
| Uppercase Letter | 22487 | 14.3% |
| Space Separator | 17734 | 11.3% |
| Other Letter | 9891 | 6.3% |
| Other Punctuation | 2519 | 1.6% |
| Decimal Number | 1257 | 0.8% |
| Dash Punctuation | 310 | 0.2% |
| Modifier Letter | 290 | 0.2% |
| Nonspacing Mark | 184 | 0.1% |
| Spacing Mark | 116 | 0.1% |
| Other values (11) | 178 | 0.1% |
Most frequent character per category
Other Letter
| Value | Count | Frequency (%) |
| の | 277 | 2.8% |
| ン | 258 | 2.6% |
| ラ | 140 | 1.4% |
| ス | 118 | 1.2% |
| ド | 114 | 1.2% |
| ル | 111 | 1.1% |
| ト | 97 | 1.0% |
| 場 | 93 | 0.9% |
| 劇 | 93 | 0.9% |
| 版 | 92 | 0.9% |
| Other values (1738) | 8498 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 14622 | |
| a | 9276 | 9.1% |
| o | 8352 | 8.2% |
| n | 7768 | 7.6% |
| r | 7701 | 7.6% |
| i | 7462 | 7.3% |
| t | 6900 | 6.8% |
| s | 5742 | 5.6% |
| l | 4893 | 4.8% |
| h | 4891 | 4.8% |
| Other values (113) | 24219 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 2862 | 12.7% |
| S | 1877 | 8.3% |
| M | 1454 | 6.5% |
| B | 1390 | 6.2% |
| A | 1322 | 5.9% |
| D | 1260 | 5.6% |
| C | 1230 | 5.5% |
| L | 1160 | 5.2% |
| P | 1054 | 4.7% |
| H | 993 | 4.4% |
| Other values (66) | 7885 |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ్ | 15 | 8.2% |
| े | 15 | 8.2% |
| ั | 14 | 7.6% |
| ् | 11 | 6.0% |
| ి | 10 | 5.4% |
| ं | 10 | 5.4% |
| ์ | 10 | 5.4% |
| ้ | 9 | 4.9% |
| ా | 8 | 4.3% |
| ิ | 7 | 3.8% |
| Other values (28) | 75 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1095 | |
| ' | 433 | 17.2% |
| . | 270 | 10.7% |
| ! | 181 | 7.2% |
| , | 162 | 6.4% |
| & | 131 | 5.2% |
| ・ | 59 | 2.3% |
| / | 40 | 1.6% |
| ? | 34 | 1.3% |
| : | 25 | 1.0% |
| Other values (18) | 89 | 3.5% |
Spacing Mark
| Value | Count | Frequency (%) |
| ा | 43 | |
| ी | 15 | 12.9% |
| ि | 12 | 10.3% |
| ు | 9 | 7.8% |
| ం | 7 | 6.0% |
| ி | 6 | 5.2% |
| ি | 6 | 5.2% |
| ो | 5 | 4.3% |
| া | 3 | 2.6% |
| ு | 2 | 1.7% |
| Other values (7) | 8 | 6.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 401 | |
| 3 | 210 | |
| 1 | 171 | |
| 0 | 149 | 11.9% |
| 4 | 86 | 6.8% |
| 5 | 57 | 4.5% |
| 9 | 53 | 4.2% |
| 7 | 44 | 3.5% |
| 6 | 41 | 3.3% |
| 8 | 37 | 2.9% |
| Other values (4) | 8 | 0.6% |
Math Symbol
| Value | Count | Frequency (%) |
| ~ | 30 | |
| × | 6 | 12.2% |
| + | 4 | 8.2% |
| ~ | 4 | 8.2% |
| | | 3 | 6.1% |
| + | 1 | 2.0% |
| ∞ | 1 | 2.0% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 12 | |
| [ | 10 | |
| 「 | 8 | |
| ( | 7 | |
| 〈 | 3 | 7.1% |
| 『 | 1 | 2.4% |
| 【 | 1 | 2.4% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 12 | |
| ] | 10 | |
| 」 | 8 | |
| ) | 7 | |
| 〉 | 3 | 7.1% |
| 』 | 1 | 2.4% |
| 】 | 1 | 2.4% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 294 | |
| 〜 | 10 | 3.2% |
| ― | 2 | 0.6% |
| – | 2 | 0.6% |
| - | 2 | 0.6% |
Other Number
| Value | Count | Frequency (%) |
| ½ | 6 | |
| ² | 2 | 18.2% |
| ³ | 2 | 18.2% |
| ⅓ | 1 | 9.1% |
Modifier Letter
| Value | Count | Frequency (%) |
| ー | 287 | |
| 々 | 2 | 0.7% |
| ʻ | 1 | 0.3% |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 8 | |
| ” | 1 | 10.0% |
| » | 1 | 10.0% |
Other Symbol
| Value | Count | Frequency (%) |
| ☆ | 6 | |
| ° | 1 | 12.5% |
| △ | 1 | 12.5% |
Letter Number
| Value | Count | Frequency (%) |
| Ⅱ | 3 | |
| Ⅰ | 2 | |
| Ⅲ | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 17714 | ||
| 20 | 0.1% |
Format
| Value | Count | Frequency (%) |
| | 2 | |
| | 1 |
Currency Symbol
| Value | Count | Frequency (%) |
| ¢ | 2 | |
| $ | 1 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 1 | |
| « | 1 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 123292 | |
| Common | 22272 | 14.2% |
| Han | 3728 | 2.4% |
| Katakana | 2604 | 1.7% |
| Hangul | 1610 | 1.0% |
| Hiragana | 1269 | 0.8% |
| Cyrillic | 989 | 0.6% |
| Devanagari | 337 | 0.2% |
| Thai | 323 | 0.2% |
| Telugu | 138 | 0.1% |
| Other values (7) | 230 | 0.1% |
Most frequent character per script
Han
| Value | Count | Frequency (%) |
| 場 | 93 | 2.5% |
| 劇 | 93 | 2.5% |
| 版 | 92 | 2.5% |
| 女 | 50 | 1.3% |
| 大 | 41 | 1.1% |
| 之 | 41 | 1.1% |
| 人 | 37 | 1.0% |
| 戦 | 33 | 0.9% |
| 神 | 30 | 0.8% |
| 王 | 27 | 0.7% |
| Other values (1040) | 3191 |
Hangul
| Value | Count | Frequency (%) |
| 의 | 61 | 3.8% |
| 마 | 34 | 2.1% |
| 이 | 28 | 1.7% |
| 한 | 27 | 1.7% |
| 사 | 26 | 1.6% |
| 엄 | 25 | 1.6% |
| 스 | 24 | 1.5% |
| 아 | 24 | 1.5% |
| 친 | 22 | 1.4% |
| 여 | 21 | 1.3% |
| Other values (383) | 1318 |
Latin
| Value | Count | Frequency (%) |
| e | 14622 | 11.9% |
| a | 9276 | 7.5% |
| o | 8352 | 6.8% |
| n | 7768 | 6.3% |
| r | 7701 | 6.2% |
| i | 7462 | 6.1% |
| t | 6900 | 5.6% |
| s | 5742 | 4.7% |
| l | 4893 | 4.0% |
| h | 4891 | 4.0% |
| Other values (113) | 45685 |
Common
| Value | Count | Frequency (%) |
| 17714 | ||
| : | 1095 | 4.9% |
| ' | 433 | 1.9% |
| 2 | 401 | 1.8% |
| - | 294 | 1.3% |
| ー | 287 | 1.3% |
| . | 270 | 1.2% |
| 3 | 210 | 0.9% |
| ! | 181 | 0.8% |
| 1 | 171 | 0.8% |
| Other values (75) | 1216 | 5.5% |
Katakana
| Value | Count | Frequency (%) |
| ン | 258 | 9.9% |
| ラ | 140 | 5.4% |
| ス | 118 | 4.5% |
| ド | 114 | 4.4% |
| ル | 111 | 4.3% |
| ト | 97 | 3.7% |
| イ | 86 | 3.3% |
| ア | 75 | 2.9% |
| ッ | 68 | 2.6% |
| リ | 66 | 2.5% |
| Other values (69) | 1471 |
Hiragana
| Value | Count | Frequency (%) |
| の | 277 | |
| と | 61 | 4.8% |
| ん | 49 | 3.9% |
| た | 47 | 3.7% |
| い | 43 | 3.4% |
| る | 40 | 3.2% |
| か | 39 | 3.1% |
| を | 36 | 2.8% |
| し | 35 | 2.8% |
| ら | 34 | 2.7% |
| Other values (58) | 608 |
Cyrillic
| Value | Count | Frequency (%) |
| а | 94 | 9.5% |
| о | 85 | 8.6% |
| е | 73 | 7.4% |
| и | 68 | 6.9% |
| р | 67 | 6.8% |
| н | 61 | 6.2% |
| т | 42 | 4.2% |
| к | 41 | 4.1% |
| л | 37 | 3.7% |
| с | 32 | 3.2% |
| Other values (46) | 389 |
Devanagari
| Value | Count | Frequency (%) |
| ा | 43 | 12.8% |
| र | 18 | 5.3% |
| न | 17 | 5.0% |
| म | 16 | 4.7% |
| े | 15 | 4.5% |
| ी | 15 | 4.5% |
| ल | 14 | 4.2% |
| द | 13 | 3.9% |
| ग | 13 | 3.9% |
| ि | 12 | 3.6% |
| Other values (38) | 161 |
Thai
| Value | Count | Frequency (%) |
| า | 25 | 7.7% |
| ก | 23 | 7.1% |
| ร | 21 | 6.5% |
| อ | 17 | 5.3% |
| เ | 15 | 4.6% |
| ม | 14 | 4.3% |
| ั | 14 | 4.3% |
| น | 13 | 4.0% |
| ต | 13 | 4.0% |
| ง | 11 | 3.4% |
| Other values (38) | 157 |
Telugu
| Value | Count | Frequency (%) |
| ్ | 15 | 10.9% |
| ర | 12 | 8.7% |
| ి | 10 | 7.2% |
| ు | 9 | 6.5% |
| ా | 8 | 5.8% |
| గ | 8 | 5.8% |
| ం | 7 | 5.1% |
| బ | 6 | 4.3% |
| ల | 5 | 3.6% |
| న | 5 | 3.6% |
| Other values (26) | 53 |
Arabic
| Value | Count | Frequency (%) |
| ا | 12 | |
| ر | 7 | 8.9% |
| م | 6 | 7.6% |
| ل | 6 | 7.6% |
| س | 5 | 6.3% |
| ف | 4 | 5.1% |
| ب | 4 | 5.1% |
| و | 3 | 3.8% |
| ن | 3 | 3.8% |
| ه | 3 | 3.8% |
| Other values (16) | 26 |
Greek
| Value | Count | Frequency (%) |
| α | 4 | 10.3% |
| ν | 3 | 7.7% |
| μ | 3 | 7.7% |
| ο | 3 | 7.7% |
| ς | 3 | 7.7% |
| υ | 2 | 5.1% |
| ρ | 2 | 5.1% |
| η | 2 | 5.1% |
| έ | 2 | 5.1% |
| λ | 2 | 5.1% |
| Other values (13) | 13 |
Bengali
| Value | Count | Frequency (%) |
| ি | 6 | |
| ম | 3 | 8.6% |
| জ | 3 | 8.6% |
| া | 3 | 8.6% |
| র | 3 | 8.6% |
| ত | 2 | 5.7% |
| ক | 2 | 5.7% |
| গ | 1 | 2.9% |
| ল | 1 | 2.9% |
| স | 1 | 2.9% |
| Other values (10) | 10 |
Tamil
| Value | Count | Frequency (%) |
| ் | 6 | |
| ி | 6 | |
| க | 4 | |
| த | 3 | |
| ர | 3 | |
| ம | 2 | 5.4% |
| ு | 2 | 5.4% |
| ந | 2 | 5.4% |
| ல | 1 | 2.7% |
| ய | 1 | 2.7% |
| Other values (7) | 7 |
Kannada
| Value | Count | Frequency (%) |
| ್ | 3 | |
| ಎ | 2 | |
| ಫ | 2 | |
| ಿ | 2 | |
| ಜ | 2 | |
| ೆ | 2 | |
| ಕ | 2 | |
| ಯ | 2 | |
| ೧ | 1 | 4.8% |
| ಾ | 1 | 4.8% |
| Other values (2) | 2 |
Malayalam
| Value | Count | Frequency (%) |
| മ | 2 | |
| ി | 2 | |
| ന | 2 | |
| ് | 1 | |
| ൽ | 1 | |
| ു | 1 | |
| ര | 1 | |
| ള | 1 |
Inherited
| Value | Count | Frequency (%) |
| ゙ | 5 | |
| ̀ | 2 | 25.0% |
| | 1 | 12.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 144563 | |
| CJK | 3726 | 2.4% |
| Katakana | 2950 | 1.9% |
| Hangul | 1610 | 1.0% |
| Hiragana | 1274 | 0.8% |
| Cyrillic | 989 | 0.6% |
| None | 658 | 0.4% |
| Devanagari | 337 | 0.2% |
| Thai | 323 | 0.2% |
| Telugu | 138 | 0.1% |
| Other values (14) | 224 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 17714 | 12.3% | |
| e | 14622 | 10.1% |
| a | 9276 | 6.4% |
| o | 8352 | 5.8% |
| n | 7768 | 5.4% |
| r | 7701 | 5.3% |
| i | 7462 | 5.2% |
| t | 6900 | 4.8% |
| s | 5742 | 4.0% |
| l | 4893 | 3.4% |
| Other values (77) | 54133 |
Katakana
| Value | Count | Frequency (%) |
| ー | 287 | 9.7% |
| ン | 258 | 8.7% |
| ラ | 140 | 4.7% |
| ス | 118 | 4.0% |
| ド | 114 | 3.9% |
| ル | 111 | 3.8% |
| ト | 97 | 3.3% |
| イ | 86 | 2.9% |
| ア | 75 | 2.5% |
| ッ | 68 | 2.3% |
| Other values (71) | 1596 |
Hiragana
| Value | Count | Frequency (%) |
| の | 277 | |
| と | 61 | 4.8% |
| ん | 49 | 3.8% |
| た | 47 | 3.7% |
| い | 43 | 3.4% |
| る | 40 | 3.1% |
| か | 39 | 3.1% |
| を | 36 | 2.8% |
| し | 35 | 2.7% |
| ら | 34 | 2.7% |
| Other values (59) | 613 |
None
| Value | Count | Frequency (%) |
| é | 103 | 15.7% |
| ó | 36 | 5.5% |
| è | 35 | 5.3% |
| ~ | 30 | 4.6% |
| í | 25 | 3.8% |
| : | 25 | 3.8% |
| 20 | 3.0% | |
| á | 20 | 3.0% |
| ! | 20 | 3.0% |
| à | 19 | 2.9% |
| Other values (115) | 325 |
Cyrillic
| Value | Count | Frequency (%) |
| а | 94 | 9.5% |
| о | 85 | 8.6% |
| е | 73 | 7.4% |
| и | 68 | 6.9% |
| р | 67 | 6.8% |
| н | 61 | 6.2% |
| т | 42 | 4.2% |
| к | 41 | 4.1% |
| л | 37 | 3.7% |
| с | 32 | 3.2% |
| Other values (46) | 389 |
CJK
| Value | Count | Frequency (%) |
| 場 | 93 | 2.5% |
| 劇 | 93 | 2.5% |
| 版 | 92 | 2.5% |
| 女 | 50 | 1.3% |
| 大 | 41 | 1.1% |
| 之 | 41 | 1.1% |
| 人 | 37 | 1.0% |
| 戦 | 33 | 0.9% |
| 神 | 30 | 0.8% |
| 王 | 27 | 0.7% |
| Other values (1039) | 3189 |
Hangul
| Value | Count | Frequency (%) |
| 의 | 61 | 3.8% |
| 마 | 34 | 2.1% |
| 이 | 28 | 1.7% |
| 한 | 27 | 1.7% |
| 사 | 26 | 1.6% |
| 엄 | 25 | 1.6% |
| 스 | 24 | 1.5% |
| 아 | 24 | 1.5% |
| 친 | 22 | 1.4% |
| 여 | 21 | 1.3% |
| Other values (383) | 1318 |
Devanagari
| Value | Count | Frequency (%) |
| ा | 43 | 12.8% |
| र | 18 | 5.3% |
| न | 17 | 5.0% |
| म | 16 | 4.7% |
| े | 15 | 4.5% |
| ी | 15 | 4.5% |
| ल | 14 | 4.2% |
| द | 13 | 3.9% |
| ग | 13 | 3.9% |
| ि | 12 | 3.6% |
| Other values (38) | 161 |
Thai
| Value | Count | Frequency (%) |
| า | 25 | 7.7% |
| ก | 23 | 7.1% |
| ร | 21 | 6.5% |
| อ | 17 | 5.3% |
| เ | 15 | 4.6% |
| ม | 14 | 4.3% |
| ั | 14 | 4.3% |
| น | 13 | 4.0% |
| ต | 13 | 4.0% |
| ง | 11 | 3.4% |
| Other values (38) | 157 |
Telugu
| Value | Count | Frequency (%) |
| ్ | 15 | 10.9% |
| ర | 12 | 8.7% |
| ి | 10 | 7.2% |
| ు | 9 | 6.5% |
| ా | 8 | 5.8% |
| గ | 8 | 5.8% |
| ం | 7 | 5.1% |
| బ | 6 | 4.3% |
| ల | 5 | 3.6% |
| న | 5 | 3.6% |
| Other values (26) | 53 |
Arabic
| Value | Count | Frequency (%) |
| ا | 12 | |
| ر | 7 | 8.9% |
| م | 6 | 7.6% |
| ل | 6 | 7.6% |
| س | 5 | 6.3% |
| ف | 4 | 5.1% |
| ب | 4 | 5.1% |
| و | 3 | 3.8% |
| ن | 3 | 3.8% |
| ه | 3 | 3.8% |
| Other values (16) | 26 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 8 | |
| ― | 2 | 10.5% |
| – | 2 | 10.5% |
| | 2 | 10.5% |
| … | 1 | 5.3% |
| ‧ | 1 | 5.3% |
| “ | 1 | 5.3% |
| ” | 1 | 5.3% |
| | 1 | 5.3% |
Tamil
| Value | Count | Frequency (%) |
| ் | 6 | |
| ி | 6 | |
| க | 4 | |
| த | 3 | |
| ர | 3 | |
| ம | 2 | 5.4% |
| ு | 2 | 5.4% |
| ந | 2 | 5.4% |
| ல | 1 | 2.7% |
| ய | 1 | 2.7% |
| Other values (7) | 7 |
Misc Symbols
| Value | Count | Frequency (%) |
| ☆ | 6 |
Bengali
| Value | Count | Frequency (%) |
| ি | 6 | |
| ম | 3 | 8.6% |
| জ | 3 | 8.6% |
| া | 3 | 8.6% |
| র | 3 | 8.6% |
| ত | 2 | 5.7% |
| ক | 2 | 5.7% |
| গ | 1 | 2.9% |
| ল | 1 | 2.9% |
| স | 1 | 2.9% |
| Other values (10) | 10 |
Kannada
| Value | Count | Frequency (%) |
| ್ | 3 | |
| ಎ | 2 | |
| ಫ | 2 | |
| ಿ | 2 | |
| ಜ | 2 | |
| ೆ | 2 | |
| ಕ | 2 | |
| ಯ | 2 | |
| ೧ | 1 | 4.8% |
| ಾ | 1 | 4.8% |
| Other values (2) | 2 |
Number Forms
| Value | Count | Frequency (%) |
| Ⅱ | 3 | |
| Ⅰ | 2 | |
| Ⅲ | 2 | |
| ⅓ | 1 | 12.5% |
Malayalam
| Value | Count | Frequency (%) |
| മ | 2 | |
| ി | 2 | |
| ന | 2 | |
| ് | 1 | |
| ൽ | 1 | |
| ു | 1 | |
| ര | 1 | |
| ള | 1 |
Diacriticals
| Value | Count | Frequency (%) |
| ̀ | 2 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ọ | 2 |
CJK Compat Forms
| Value | Count | Frequency (%) |
| ︰ | 1 |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 1 |
Math Operators
| Value | Count | Frequency (%) |
| ∞ | 1 |
Geometric Shapes
| Value | Count | Frequency (%) |
| △ | 1 |
Overview
Text
| Distinct | 9946 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 51 |
| Missing (%) | 0.5% |
| Memory size | 156.2 KiB |
Length
| Max length | 1000 |
|---|---|
| Median length | 648 |
| Mean length | 276.2556 |
| Min length | 11 |
Characters and Unicode
| Total characters | 2748467 |
|---|---|
| Distinct characters | 144 |
| Distinct categories | 19 ? |
| Distinct scripts | 3 ? |
| Distinct blocks | 7 ? |
Unique
| Unique | 9943 ? |
|---|---|
| Unique (%) | 99.9% |
Sample
| 1st row | Armed with every weapon they can get their hands on and the skills to use them, The Expendables are the world’s last line of defense and the team that gets called when all other options are off the table. But new team members with new styles and tactics are going to give “new blood” a whole new meaning. |
|---|---|
| 2nd row | Robert McCall finds himself at home in Southern Italy but he discovers his friends are under the control of local crime bosses. As events turn deadly, McCall knows what he has to do: become his friends' protector by taking on the mafia. |
| 3rd row | In 1980s Hollywood, action star Johnny Cage is looking to become an A-list actor. But when his costar, Jennifer, goes missing from set, Johnny finds himself thrust into a world filled with shadows, danger, and deceit. As he embarks on a bloody journey, Johnny quickly discovers the City of Angels has more than a few devils in its midst. |
| 4th row | Ethan Hunt and his IMF team embark on their most dangerous mission yet: To track down a terrifying new weapon that threatens all of humanity before it falls into the wrong hands. With control of the future and the world's fate at stake and dark forces from Ethan's past closing in, a deadly race around the globe begins. Confronted by a mysterious, all-powerful enemy, Ethan must consider that nothing can matter more than his mission—not even the lives of those he cares about most. |
| 5th row | A young pregnant woman named Mia escapes from a country at war by hiding in a maritime container aboard a cargo ship. After a violent storm, Mia gives birth to the child while lost at sea, where she must fight to survive. |
| Value | Count | Frequency (%) |
| the | 25823 | 5.5% |
| a | 20517 | 4.4% |
| to | 15672 | 3.3% |
| and | 13764 | 2.9% |
| of | 12638 | 2.7% |
| in | 8235 | 1.8% |
| his | 7119 | 1.5% |
| is | 6148 | 1.3% |
| her | 4789 | 1.0% |
| with | 4711 | 1.0% |
| Other values (33420) | 349955 |
Most occurring characters
| Value | Count | Frequency (%) |
| 459804 | ||
| e | 268366 | 9.8% |
| t | 182946 | 6.7% |
| a | 179364 | 6.5% |
| i | 159531 | 5.8% |
| n | 159509 | 5.8% |
| o | 157954 | 5.7% |
| s | 147563 | 5.4% |
| r | 145146 | 5.3% |
| h | 116848 | 4.3% |
| Other values (134) | 771436 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2147996 | |
| Space Separator | 459827 | 16.7% |
| Uppercase Letter | 67665 | 2.5% |
| Other Punctuation | 56668 | 2.1% |
| Dash Punctuation | 8359 | 0.3% |
| Decimal Number | 5892 | 0.2% |
| Final Punctuation | 1175 | < 0.1% |
| Open Punctuation | 312 | < 0.1% |
| Close Punctuation | 312 | < 0.1% |
| Initial Punctuation | 158 | < 0.1% |
| Other values (9) | 103 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 268366 | |
| t | 182946 | 8.5% |
| a | 179364 | 8.4% |
| i | 159531 | 7.4% |
| n | 159509 | 7.4% |
| o | 157954 | 7.4% |
| s | 147563 | 6.9% |
| r | 145146 | 6.8% |
| h | 116848 | 5.4% |
| l | 90119 | 4.2% |
| Other values (47) | 540650 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 8120 | 12.0% |
| T | 5570 | 8.2% |
| S | 5511 | 8.1% |
| B | 4252 | 6.3% |
| C | 3995 | 5.9% |
| M | 3993 | 5.9% |
| W | 3803 | 5.6% |
| H | 3300 | 4.9% |
| D | 2843 | 4.2% |
| I | 2735 | 4.0% |
| Other values (21) | 23543 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 25528 | |
| . | 22381 | |
| ' | 5636 | 9.9% |
| " | 1356 | 2.4% |
| : | 582 | 1.0% |
| ? | 407 | 0.7% |
| ! | 350 | 0.6% |
| ; | 176 | 0.3% |
| … | 112 | 0.2% |
| / | 66 | 0.1% |
| Other values (5) | 74 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1394 | |
| 0 | 1194 | |
| 9 | 796 | |
| 2 | 623 | |
| 5 | 347 | 5.9% |
| 3 | 332 | 5.6% |
| 8 | 331 | 5.6% |
| 7 | 310 | 5.3% |
| 6 | 293 | 5.0% |
| 4 | 272 | 4.6% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 7471 | |
| — | 594 | 7.1% |
| – | 294 | 3.5% |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 1052 | |
| ” | 121 | 10.3% |
| » | 2 | 0.2% |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 120 | |
| ‘ | 36 | 22.8% |
| « | 2 | 1.3% |
Format
| Value | Count | Frequency (%) |
| | 2 | |
| | 1 | |
| | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 459804 | ||
| 23 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 308 | |
| [ | 4 | 1.3% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 308 | |
| ] | 4 | 1.3% |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 57 | |
| £ | 1 | 1.7% |
Control
| Value | Count | Frequency (%) |
| 14 | ||
| 1 | 6.7% |
Other Symbol
| Value | Count | Frequency (%) |
| ™ | 10 | |
| ® | 2 | 16.7% |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ́ | 2 | |
| ̈ | 1 |
Other Number
| Value | Count | Frequency (%) |
| ¹ | 1 | |
| ² | 1 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 4 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 3 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2215661 | |
| Common | 532803 | 19.4% |
| Inherited | 3 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 268366 | |
| t | 182946 | 8.3% |
| a | 179364 | 8.1% |
| i | 159531 | 7.2% |
| n | 159509 | 7.2% |
| o | 157954 | 7.1% |
| s | 147563 | 6.7% |
| r | 145146 | 6.6% |
| h | 116848 | 5.3% |
| l | 90119 | 4.1% |
| Other values (78) | 608315 |
Common
| Value | Count | Frequency (%) |
| 459804 | ||
| , | 25528 | 4.8% |
| . | 22381 | 4.2% |
| - | 7471 | 1.4% |
| ' | 5636 | 1.1% |
| 1 | 1394 | 0.3% |
| " | 1356 | 0.3% |
| 0 | 1194 | 0.2% |
| ’ | 1052 | 0.2% |
| 9 | 796 | 0.1% |
| Other values (44) | 6191 | 1.2% |
Inherited
| Value | Count | Frequency (%) |
| ́ | 2 | |
| ̈ | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2745676 | |
| Punctuation | 2336 | 0.1% |
| None | 438 | < 0.1% |
| Letterlike Symbols | 10 | < 0.1% |
| Modifier Letters | 3 | < 0.1% |
| Diacriticals | 3 | < 0.1% |
| Alphabetic PF | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 459804 | ||
| e | 268366 | 9.8% |
| t | 182946 | 6.7% |
| a | 179364 | 6.5% |
| i | 159531 | 5.8% |
| n | 159509 | 5.8% |
| o | 157954 | 5.8% |
| s | 147563 | 5.4% |
| r | 145146 | 5.3% |
| h | 116848 | 4.3% |
| Other values (75) | 768645 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 1052 | |
| — | 594 | |
| – | 294 | 12.6% |
| ” | 121 | 5.2% |
| “ | 120 | 5.1% |
| … | 112 | 4.8% |
| ‘ | 36 | 1.5% |
| • | 3 | 0.1% |
| | 2 | 0.1% |
| | 1 | < 0.1% |
None
| Value | Count | Frequency (%) |
| é | 204 | |
| ō | 25 | 5.7% |
| í | 25 | 5.7% |
| 23 | 5.3% | |
| á | 22 | 5.0% |
| è | 16 | 3.7% |
| ū | 11 | 2.5% |
| ï | 11 | 2.5% |
| ä | 9 | 2.1% |
| ç | 9 | 2.1% |
| Other values (33) | 83 |
Letterlike Symbols
| Value | Count | Frequency (%) |
| ™ | 10 |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 3 |
Diacriticals
| Value | Count | Frequency (%) |
| ́ | 2 | |
| ̈ | 1 |
Alphabetic PF
| Value | Count | Frequency (%) |
| fi | 1 |
Popularity
Real number (ℝ)
SKEWED 
| Distinct | 8069 |
|---|---|
| Distinct (%) | 80.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 34.248141 |
| Minimum | 13.049 |
|---|---|
| Maximum | 3741.062 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 156.2 KiB |
Quantile statistics
| Minimum | 13.049 |
|---|---|
| 5-th percentile | 13.47095 |
| Q1 | 15.59075 |
| median | 20.096 |
| Q3 | 30.3105 |
| 95-th percentile | 79.9818 |
| Maximum | 3741.062 |
| Range | 3728.013 |
| Interquartile range (IQR) | 14.71975 |
Descriptive statistics
| Standard deviation | 84.332838 |
|---|---|
| Coefficient of variation (CV) | 2.4624063 |
| Kurtosis | 603.66538 |
| Mean | 34.248141 |
| Median Absolute Deviation (MAD) | 5.4355 |
| Skewness | 20.169371 |
| Sum | 342481.41 |
| Variance | 7112.0276 |
| Monotonicity | Decreasing |
| Value | Count | Frequency (%) |
| 13.297 | 6 | 0.1% |
| 15.499 | 6 | 0.1% |
| 16.146 | 6 | 0.1% |
| 13.112 | 5 | 0.1% |
| 13.572 | 5 | 0.1% |
| 13.84 | 5 | 0.1% |
| 14.739 | 5 | 0.1% |
| 13.67 | 5 | 0.1% |
| 14.76 | 5 | 0.1% |
| 13.287 | 5 | 0.1% |
| Other values (8059) | 9947 |
| Value | Count | Frequency (%) |
| 13.049 | 3 | |
| 13.05 | 1 | < 0.1% |
| 13.051 | 3 | |
| 13.052 | 2 | |
| 13.054 | 3 | |
| 13.055 | 2 | |
| 13.056 | 1 | < 0.1% |
| 13.057 | 2 | |
| 13.059 | 1 | < 0.1% |
| 13.063 | 2 |
| Value | Count | Frequency (%) |
| 3741.062 | 1 | |
| 2471.515 | 1 | |
| 2223.43 | 1 | |
| 2032.927 | 1 | |
| 1627.678 | 1 | |
| 1594.559 | 1 | |
| 1521.075 | 1 | |
| 1469.177 | 1 | |
| 1315.518 | 1 | |
| 1304.978 | 1 |
ReleaseDate
Date
| Distinct | 5947 |
|---|---|
| Distinct (%) | 59.6% |
| Missing | 21 |
| Missing (%) | 0.2% |
| Memory size | 156.2 KiB |
| Minimum | 1902-04-17 00:00:00 |
|---|---|
| Maximum | 2027-05-05 00:00:00 |
Title
Text
| Distinct | 9637 |
|---|---|
| Distinct (%) | 96.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 156.2 KiB |
Length
| Max length | 104 |
|---|---|
| Median length | 72 |
| Mean length | 17.0992 |
| Min length | 1 |
Characters and Unicode
| Total characters | 170992 |
|---|---|
| Distinct characters | 155 |
| Distinct categories | 18 ? |
| Distinct scripts | 5 ? |
| Distinct blocks | 9 ? |
Unique
| Unique | 9312 ? |
|---|---|
| Unique (%) | 93.1% |
Sample
| 1st row | Expend4bles |
|---|---|
| 2nd row | The Equalizer 3 |
| 3rd row | Mortal Kombat Legends: Cage Match |
| 4th row | Mission: Impossible - Dead Reckoning Part One |
| 5th row | Nowhere |
| Value | Count | Frequency (%) |
| the | 3405 | 11.2% |
| of | 1037 | 3.4% |
| a | 413 | 1.4% |
| and | 338 | 1.1% |
| 2 | 300 | 1.0% |
| in | 299 | 1.0% |
| 212 | 0.7% | |
| to | 210 | 0.7% |
| movie | 167 | 0.5% |
| 3 | 122 | 0.4% |
| Other values (7795) | 23908 |
Most occurring characters
| Value | Count | Frequency (%) |
| 20411 | 11.9% | |
| e | 17408 | 10.2% |
| a | 10643 | 6.2% |
| o | 10266 | 6.0% |
| n | 9243 | 5.4% |
| r | 9058 | 5.3% |
| i | 8877 | 5.2% |
| t | 8378 | 4.9% |
| s | 6611 | 3.9% |
| h | 6310 | 3.7% |
| Other values (145) | 63787 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 119406 | |
| Uppercase Letter | 26585 | 15.5% |
| Space Separator | 20413 | 11.9% |
| Other Punctuation | 2857 | 1.7% |
| Decimal Number | 1286 | 0.8% |
| Dash Punctuation | 341 | 0.2% |
| Other Letter | 32 | < 0.1% |
| Open Punctuation | 21 | < 0.1% |
| Close Punctuation | 21 | < 0.1% |
| Other Number | 11 | < 0.1% |
| Other values (8) | 19 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 17408 | |
| a | 10643 | 8.9% |
| o | 10266 | 8.6% |
| n | 9243 | 7.7% |
| r | 9058 | 7.6% |
| i | 8877 | 7.4% |
| t | 8378 | 7.0% |
| s | 6611 | 5.5% |
| h | 6310 | 5.3% |
| l | 5621 | 4.7% |
| Other values (31) | 26991 |
Other Letter
| Value | Count | Frequency (%) |
| の | 2 | 6.2% |
| 後 | 1 | 3.1% |
| 午 | 1 | 3.1% |
| 田 | 1 | 3.1% |
| 中 | 1 | 3.1% |
| 瞳 | 1 | 3.1% |
| 爆 | 1 | 3.1% |
| 乳 | 1 | 3.1% |
| 衝 | 1 | 3.1% |
| 王 | 1 | 3.1% |
| Other values (21) | 21 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 3506 | 13.2% |
| S | 2362 | 8.9% |
| M | 1816 | 6.8% |
| B | 1705 | 6.4% |
| D | 1521 | 5.7% |
| A | 1503 | 5.7% |
| C | 1489 | 5.6% |
| P | 1252 | 4.7% |
| L | 1219 | 4.6% |
| H | 1125 | 4.2% |
| Other values (20) | 9087 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1476 | |
| ' | 492 | 17.2% |
| . | 282 | 9.9% |
| , | 192 | 6.7% |
| ! | 169 | 5.9% |
| & | 141 | 4.9% |
| / | 39 | 1.4% |
| ? | 36 | 1.3% |
| * | 8 | 0.3% |
| " | 6 | 0.2% |
| Other values (9) | 16 | 0.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 408 | |
| 3 | 217 | |
| 1 | 178 | |
| 0 | 162 | 12.6% |
| 4 | 90 | 7.0% |
| 5 | 56 | 4.4% |
| 9 | 56 | 4.4% |
| 7 | 44 | 3.4% |
| 6 | 39 | 3.0% |
| 8 | 36 | 2.8% |
Other Number
| Value | Count | Frequency (%) |
| ½ | 6 | |
| ³ | 2 | 18.2% |
| ⁴ | 1 | 9.1% |
| ² | 1 | 9.1% |
| ⅓ | 1 | 9.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 326 | |
| – | 11 | 3.2% |
| — | 4 | 1.2% |
Space Separator
| Value | Count | Frequency (%) |
| 20411 | ||
| 2 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 16 | |
| [ | 5 | 23.8% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 16 | |
| ] | 5 | 23.8% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 4 | |
| | | 1 | 20.0% |
Currency Symbol
| Value | Count | Frequency (%) |
| ¢ | 2 | |
| $ | 1 |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 4 |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ̀ | 2 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2 |
Format
| Value | Count | Frequency (%) |
| | 1 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 1 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 145991 | |
| Common | 24967 | 14.6% |
| Han | 26 | < 0.1% |
| Hiragana | 6 | < 0.1% |
| Inherited | 2 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 17408 | 11.9% |
| a | 10643 | 7.3% |
| o | 10266 | 7.0% |
| n | 9243 | 6.3% |
| r | 9058 | 6.2% |
| i | 8877 | 6.1% |
| t | 8378 | 5.7% |
| s | 6611 | 4.5% |
| h | 6310 | 4.3% |
| l | 5621 | 3.9% |
| Other values (61) | 53576 |
Common
| Value | Count | Frequency (%) |
| 20411 | ||
| : | 1476 | 5.9% |
| ' | 492 | 2.0% |
| 2 | 408 | 1.6% |
| - | 326 | 1.3% |
| . | 282 | 1.1% |
| 3 | 217 | 0.9% |
| , | 192 | 0.8% |
| 1 | 178 | 0.7% |
| ! | 169 | 0.7% |
| Other values (42) | 816 | 3.3% |
Han
| Value | Count | Frequency (%) |
| 後 | 1 | 3.8% |
| 午 | 1 | 3.8% |
| 田 | 1 | 3.8% |
| 中 | 1 | 3.8% |
| 瞳 | 1 | 3.8% |
| 爆 | 1 | 3.8% |
| 乳 | 1 | 3.8% |
| 衝 | 1 | 3.8% |
| 王 | 1 | 3.8% |
| 力 | 1 | 3.8% |
| Other values (16) | 16 |
Hiragana
| Value | Count | Frequency (%) |
| の | 2 | |
| え | 1 | |
| な | 1 | |
| い | 1 | |
| り | 1 |
Inherited
| Value | Count | Frequency (%) |
| ̀ | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 170815 | |
| None | 117 | 0.1% |
| CJK | 26 | < 0.1% |
| Punctuation | 22 | < 0.1% |
| Hiragana | 6 | < 0.1% |
| Diacriticals | 2 | < 0.1% |
| Latin Ext Additional | 2 | < 0.1% |
| Modifier Letters | 1 | < 0.1% |
| Number Forms | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 20411 | 11.9% | |
| e | 17408 | 10.2% |
| a | 10643 | 6.2% |
| o | 10266 | 6.0% |
| n | 9243 | 5.4% |
| r | 9058 | 5.3% |
| i | 8877 | 5.2% |
| t | 8378 | 4.9% |
| s | 6611 | 3.9% |
| h | 6310 | 3.7% |
| Other values (76) | 63610 |
None
| Value | Count | Frequency (%) |
| é | 49 | |
| ó | 8 | 6.8% |
| í | 6 | 5.1% |
| ½ | 6 | 5.1% |
| á | 5 | 4.3% |
| à | 4 | 3.4% |
| ¡ | 4 | 3.4% |
| ā | 3 | 2.6% |
| ñ | 3 | 2.6% |
| ¿ | 3 | 2.6% |
| Other values (19) | 26 |
Punctuation
| Value | Count | Frequency (%) |
| – | 11 | |
| — | 4 | 18.2% |
| ’ | 4 | 18.2% |
| 2 | 9.1% | |
| … | 1 | 4.5% |
Diacriticals
| Value | Count | Frequency (%) |
| ̀ | 2 |
Hiragana
| Value | Count | Frequency (%) |
| の | 2 | |
| え | 1 | |
| な | 1 | |
| い | 1 | |
| り | 1 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ọ | 2 |
CJK
| Value | Count | Frequency (%) |
| 後 | 1 | 3.8% |
| 午 | 1 | 3.8% |
| 田 | 1 | 3.8% |
| 中 | 1 | 3.8% |
| 瞳 | 1 | 3.8% |
| 爆 | 1 | 3.8% |
| 乳 | 1 | 3.8% |
| 衝 | 1 | 3.8% |
| 王 | 1 | 3.8% |
| 力 | 1 | 3.8% |
| Other values (16) | 16 |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 1 |
Number Forms
| Value | Count | Frequency (%) |
| ⅓ | 1 |
VoteAverage
Real number (ℝ)
ZEROS 
| Distinct | 73 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.36797 |
| Minimum | 0 |
|---|---|
| Maximum | 10 |
| Zeros | 261 |
| Zeros (%) | 2.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 156.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 4.5 |
| Q1 | 6 |
| median | 6.6 |
| Q3 | 7.1 |
| 95-th percentile | 7.9 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 1.1 |
Descriptive statistics
| Standard deviation | 1.3807251 |
|---|---|
| Coefficient of variation (CV) | 0.21682343 |
| Kurtosis | 9.7538572 |
| Mean | 6.36797 |
| Median Absolute Deviation (MAD) | 0.6 |
| Skewness | -2.5954374 |
| Sum | 63679.7 |
| Variance | 1.9064017 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6.5 | 466 | 4.7% |
| 6.8 | 448 | 4.5% |
| 7 | 447 | 4.5% |
| 6.3 | 438 | 4.4% |
| 6.6 | 437 | 4.4% |
| 6.7 | 430 | 4.3% |
| 6.9 | 423 | 4.2% |
| 6.4 | 415 | 4.2% |
| 6.2 | 410 | 4.1% |
| 6.1 | 389 | 3.9% |
| Other values (63) | 5697 |
| Value | Count | Frequency (%) |
| 0 | 261 | |
| 1 | 10 | 0.1% |
| 1.5 | 1 | < 0.1% |
| 2 | 7 | 0.1% |
| 2.3 | 1 | < 0.1% |
| 2.5 | 1 | < 0.1% |
| 2.7 | 2 | < 0.1% |
| 2.8 | 1 | < 0.1% |
| 2.9 | 3 | < 0.1% |
| 3 | 10 | 0.1% |
| Value | Count | Frequency (%) |
| 10 | 10 | |
| 9.8 | 1 | < 0.1% |
| 9.5 | 2 | < 0.1% |
| 9.3 | 1 | < 0.1% |
| 9 | 8 | 0.1% |
| 8.9 | 1 | < 0.1% |
| 8.8 | 5 | 0.1% |
| 8.7 | 2 | < 0.1% |
| 8.6 | 6 | 0.1% |
| 8.5 | 21 |
VoteCount
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 3543 |
|---|---|
| Distinct (%) | 35.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1620.1538 |
| Minimum | 0 |
|---|---|
| Maximum | 34628 |
| Zeros | 260 |
| Zeros (%) | 2.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 156.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 168 |
| median | 563 |
| Q3 | 1665 |
| 95-th percentile | 6937.25 |
| Maximum | 34628 |
| Range | 34628 |
| Interquartile range (IQR) | 1497 |
Descriptive statistics
| Standard deviation | 2960.6426 |
|---|---|
| Coefficient of variation (CV) | 1.8273837 |
| Kurtosis | 21.895306 |
| Mean | 1620.1538 |
| Median Absolute Deviation (MAD) | 488 |
| Skewness | 4.0440426 |
| Sum | 16201538 |
| Variance | 8765404.7 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 260 | 2.6% |
| 1 | 72 | 0.7% |
| 2 | 60 | 0.6% |
| 3 | 49 | 0.5% |
| 5 | 46 | 0.5% |
| 4 | 45 | 0.4% |
| 8 | 39 | 0.4% |
| 7 | 36 | 0.4% |
| 6 | 33 | 0.3% |
| 10 | 30 | 0.3% |
| Other values (3533) | 9330 |
| Value | Count | Frequency (%) |
| 0 | 260 | |
| 1 | 72 | 0.7% |
| 2 | 60 | 0.6% |
| 3 | 49 | 0.5% |
| 4 | 45 | 0.4% |
| 5 | 46 | 0.5% |
| 6 | 33 | 0.3% |
| 7 | 36 | 0.4% |
| 8 | 39 | 0.4% |
| 9 | 27 | 0.3% |
| Value | Count | Frequency (%) |
| 34628 | 1 | |
| 32726 | 1 | |
| 30768 | 1 | |
| 29904 | 1 | |
| 29241 | 1 | |
| 28971 | 1 | |
| 27827 | 1 | |
| 27366 | 1 | |
| 26721 | 1 | |
| 26016 | 1 |
Budget
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 710 |
|---|---|
| Distinct (%) | 7.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20439089 |
| Minimum | 0 |
|---|---|
| Maximum | 4.6 × 108 |
| Zeros | 4472 |
| Zeros (%) | 44.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 156.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 2130000 |
| Q3 | 25000000 |
| 95-th percentile | 1 × 108 |
| Maximum | 4.6 × 108 |
| Range | 4.6 × 108 |
| Interquartile range (IQR) | 25000000 |
Descriptive statistics
| Standard deviation | 38786607 |
|---|---|
| Coefficient of variation (CV) | 1.8976681 |
| Kurtosis | 13.84859 |
| Mean | 20439089 |
| Median Absolute Deviation (MAD) | 2130000 |
| Skewness | 3.2401366 |
| Sum | 2.0439089 × 1011 |
| Variance | 1.5044009 × 1015 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 4472 | |
| 20000000 | 210 | 2.1% |
| 30000000 | 184 | 1.8% |
| 25000000 | 173 | 1.7% |
| 10000000 | 169 | 1.7% |
| 15000000 | 165 | 1.7% |
| 40000000 | 157 | 1.6% |
| 5000000 | 145 | 1.5% |
| 50000000 | 133 | 1.3% |
| 35000000 | 132 | 1.3% |
| Other values (700) | 4060 |
| Value | Count | Frequency (%) |
| 0 | 4472 | |
| 1 | 3 | < 0.1% |
| 4 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| 15 | 1 | < 0.1% |
| 20 | 1 | < 0.1% |
| 26 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 460000000 | 1 | < 0.1% |
| 379000000 | 1 | < 0.1% |
| 365000000 | 1 | < 0.1% |
| 356000000 | 1 | < 0.1% |
| 340000000 | 1 | < 0.1% |
| 300000000 | 4 | |
| 297000000 | 1 | < 0.1% |
| 294700000 | 1 | < 0.1% |
| 291000000 | 1 | < 0.1% |
| 274800000 | 1 | < 0.1% |
| Distinct | 8180 |
|---|---|
| Distinct (%) | 81.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 156.2 KiB |
Length
| Max length | 2591 |
|---|---|
| Median length | 1060 |
| Mean length | 324.7863 |
| Min length | 2 |
Characters and Unicode
| Total characters | 3247863 |
|---|---|
| Distinct characters | 205 |
| Distinct categories | 16 ? |
| Distinct scripts | 6 ? |
| Distinct blocks | 8 ? |
Unique
| Unique | 7740 ? |
|---|---|
| Unique (%) | 77.4% |
Sample
| 1st row | [{'id': 1020, 'logo_path': '/kuUIHNwMec4dwOLghDhhZJzHZTd.png', 'name': 'Millennium Media', 'origin_country': 'US'}, {'id': 48738, 'logo_path': None, 'name': 'Campbell Grobman Films', 'origin_country': 'US'}, {'id': 1632, 'logo_path': '/cisLn1YAUuptXVBa0xjq7ST9cH0.png', 'name': 'Lionsgate', 'origin_country': 'US'}] |
|---|---|
| 2nd row | [{'id': 1423, 'logo_path': '/1rbAwGQzrNvXDICD6HWEn1YqfAV.png', 'name': 'Escape Artists', 'origin_country': 'US'}, {'id': 5, 'logo_path': '/wrweLpBqRYcAM7kCSaHDJRxKGOP.png', 'name': 'Columbia Pictures', 'origin_country': 'US'}, {'id': 10400, 'logo_path': '/9LlB2YAwXTkUAhx0pItSo6pDlkB.png', 'name': 'Eagle Pictures', 'origin_country': 'IT'}, {'id': 44967, 'logo_path': None, 'name': 'ZHIV Productions', 'origin_country': ''}] |
| 3rd row | [{'id': 2785, 'logo_path': '/l5zW8jjmQOCx2dFmvnmbYmqoBmL.png', 'name': 'Warner Bros. Animation', 'origin_country': 'US'}] |
| 4th row | [{'id': 4, 'logo_path': '/gz66EfNoYPqHTYI4q9UEN4CbHRc.png', 'name': 'Paramount', 'origin_country': 'US'}, {'id': 82819, 'logo_path': '/gXfFl9pRPaoaq14jybEn1pHeldr.png', 'name': 'Skydance', 'origin_country': 'US'}, {'id': 21777, 'logo_path': None, 'name': 'TC Productions', 'origin_country': 'US'}] |
| 5th row | [{'id': 204005, 'logo_path': None, 'name': 'Rock & Ruz', 'origin_country': 'ES'}] |
| Value | Count | Frequency (%) |
| id | 31298 | 10.7% |
| logo_path | 31288 | 10.7% |
| name | 31288 | 10.7% |
| origin_country | 31288 | 10.7% |
| us | 14093 | 4.8% |
| none | 12436 | 4.3% |
| 6894 | 2.4% | |
| pictures | 4744 | 1.6% |
| films | 3648 | 1.3% |
| productions | 3537 | 1.2% |
| Other values (21950) | 120954 |
Most occurring characters
| Value | Count | Frequency (%) |
| ' | 413056 | 12.7% |
| 281472 | 8.7% | |
| o | 178504 | 5.5% |
| n | 171817 | 5.3% |
| i | 143373 | 4.4% |
| : | 125161 | 3.9% |
| , | 115615 | 3.6% |
| t | 107883 | 3.3% |
| r | 105135 | 3.2% |
| a | 102782 | 3.2% |
| Other values (195) | 1503065 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1553183 | |
| Other Punctuation | 694096 | |
| Uppercase Letter | 355938 | 11.0% |
| Space Separator | 281472 | 8.7% |
| Decimal Number | 216171 | 6.7% |
| Connector Punctuation | 62579 | 1.9% |
| Close Punctuation | 41476 | 1.3% |
| Open Punctuation | 41476 | 1.3% |
| Dash Punctuation | 1007 | < 0.1% |
| Math Symbol | 297 | < 0.1% |
| Other values (6) | 168 | < 0.1% |
Most frequent character per category
Other Letter
| Value | Count | Frequency (%) |
| 有 | 8 | 5.6% |
| 公 | 8 | 5.6% |
| 司 | 8 | 5.6% |
| 限 | 8 | 5.6% |
| 文 | 5 | 3.5% |
| 化 | 5 | 3.5% |
| 影 | 5 | 3.5% |
| 上 | 4 | 2.8% |
| 海 | 3 | 2.1% |
| 주 | 3 | 2.1% |
| Other values (71) | 86 |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 178504 | |
| n | 171817 | 11.1% |
| i | 143373 | 9.2% |
| t | 107883 | 6.9% |
| r | 105135 | 6.8% |
| a | 102782 | 6.6% |
| g | 93497 | 6.0% |
| e | 93045 | 6.0% |
| p | 63197 | 4.1% |
| u | 58286 | 3.8% |
| Other values (41) | 435664 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 28655 | 8.1% |
| U | 23570 | 6.6% |
| N | 22843 | 6.4% |
| P | 22036 | 6.2% |
| F | 17544 | 4.9% |
| E | 16361 | 4.6% |
| C | 16186 | 4.5% |
| A | 14254 | 4.0% |
| R | 14248 | 4.0% |
| B | 12993 | 3.7% |
| Other values (23) | 167248 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 413056 | |
| : | 125161 | 18.0% |
| , | 115615 | 16.7% |
| . | 20518 | 3.0% |
| / | 19164 | 2.8% |
| & | 333 | < 0.1% |
| " | 226 | < 0.1% |
| ! | 16 | < 0.1% |
| @ | 4 | < 0.1% |
| 、 | 2 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 30685 | |
| 2 | 23949 | |
| 4 | 21343 | |
| 3 | 21150 | |
| 5 | 20894 | |
| 9 | 20498 | |
| 8 | 20052 | |
| 0 | 19663 | |
| 6 | 19096 | |
| 7 | 18841 |
Close Punctuation
| Value | Count | Frequency (%) |
| } | 31288 | |
| ] | 10000 | 24.1% |
| ) | 185 | 0.4% |
| ) | 3 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| { | 31288 | |
| [ | 10000 | 24.1% |
| ( | 185 | 0.4% |
| ( | 3 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1006 | |
| – | 1 | 0.1% |
Other Symbol
| Value | Count | Frequency (%) |
| ℃ | 13 | |
| ㈜ | 1 | 7.1% |
Space Separator
| Value | Count | Frequency (%) |
| 281472 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 62579 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 297 |
Other Number
| Value | Count | Frequency (%) |
| ² | 5 |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 3 |
Modifier Letter
| Value | Count | Frequency (%) |
| ー | 2 |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ́ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1909121 | |
| Common | 1338597 | |
| Han | 105 | < 0.1% |
| Hangul | 31 | < 0.1% |
| Katakana | 8 | < 0.1% |
| Inherited | 1 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 178504 | 9.4% |
| n | 171817 | 9.0% |
| i | 143373 | 7.5% |
| t | 107883 | 5.7% |
| r | 105135 | 5.5% |
| a | 102782 | 5.4% |
| g | 93497 | 4.9% |
| e | 93045 | 4.9% |
| p | 63197 | 3.3% |
| u | 58286 | 3.1% |
| Other values (74) | 791602 |
Han
| Value | Count | Frequency (%) |
| 有 | 8 | 7.6% |
| 公 | 8 | 7.6% |
| 司 | 8 | 7.6% |
| 限 | 8 | 7.6% |
| 文 | 5 | 4.8% |
| 化 | 5 | 4.8% |
| 影 | 5 | 4.8% |
| 上 | 4 | 3.8% |
| 海 | 3 | 2.9% |
| 业 | 3 | 2.9% |
| Other values (40) | 48 |
Common
| Value | Count | Frequency (%) |
| ' | 413056 | |
| 281472 | ||
| : | 125161 | 9.4% |
| , | 115615 | 8.6% |
| _ | 62579 | 4.7% |
| } | 31288 | 2.3% |
| { | 31288 | 2.3% |
| 1 | 30685 | 2.3% |
| 2 | 23949 | 1.8% |
| 4 | 21343 | 1.6% |
| Other values (28) | 202161 |
Hangul
| Value | Count | Frequency (%) |
| 주 | 3 | 9.7% |
| 을 | 2 | 6.5% |
| 사 | 2 | 6.5% |
| 가 | 2 | 6.5% |
| 화 | 2 | 6.5% |
| 영 | 2 | 6.5% |
| 룬 | 1 | 3.2% |
| 마 | 1 | 3.2% |
| 네 | 1 | 3.2% |
| 시 | 1 | 3.2% |
| Other values (14) | 14 |
Katakana
| Value | Count | Frequency (%) |
| ソ | 1 | |
| ッ | 1 | |
| リ | 1 | |
| ド | 1 | |
| フ | 1 | |
| ィ | 1 | |
| ャ | 1 | |
| チ | 1 |
Inherited
| Value | Count | Frequency (%) |
| ́ | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3246654 | |
| None | 1046 | < 0.1% |
| CJK | 105 | < 0.1% |
| Hangul | 30 | < 0.1% |
| Letterlike Symbols | 13 | < 0.1% |
| Katakana | 10 | < 0.1% |
| Punctuation | 4 | < 0.1% |
| Diacriticals | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| ' | 413056 | 12.7% |
| 281472 | 8.7% | |
| o | 178504 | 5.5% |
| n | 171817 | 5.3% |
| i | 143373 | 4.4% |
| : | 125161 | 3.9% |
| , | 115615 | 3.6% |
| t | 107883 | 3.3% |
| r | 105135 | 3.2% |
| a | 102782 | 3.2% |
| Other values (72) | 1501856 |
None
| Value | Count | Frequency (%) |
| é | 602 | |
| í | 51 | 4.9% |
| ä | 48 | 4.6% |
| ñ | 44 | 4.2% |
| É | 38 | 3.6% |
| á | 35 | 3.3% |
| ó | 34 | 3.3% |
| ö | 31 | 3.0% |
| ü | 18 | 1.7% |
| ç | 18 | 1.7% |
| Other values (27) | 127 | 12.1% |
Letterlike Symbols
| Value | Count | Frequency (%) |
| ℃ | 13 |
CJK
| Value | Count | Frequency (%) |
| 有 | 8 | 7.6% |
| 公 | 8 | 7.6% |
| 司 | 8 | 7.6% |
| 限 | 8 | 7.6% |
| 文 | 5 | 4.8% |
| 化 | 5 | 4.8% |
| 影 | 5 | 4.8% |
| 上 | 4 | 3.8% |
| 海 | 3 | 2.9% |
| 业 | 3 | 2.9% |
| Other values (40) | 48 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 3 | |
| – | 1 | 25.0% |
Hangul
| Value | Count | Frequency (%) |
| 주 | 3 | 10.0% |
| 을 | 2 | 6.7% |
| 사 | 2 | 6.7% |
| 가 | 2 | 6.7% |
| 화 | 2 | 6.7% |
| 영 | 2 | 6.7% |
| 룬 | 1 | 3.3% |
| 마 | 1 | 3.3% |
| 네 | 1 | 3.3% |
| 시 | 1 | 3.3% |
| Other values (13) | 13 |
Katakana
| Value | Count | Frequency (%) |
| ー | 2 | |
| ソ | 1 | |
| ッ | 1 | |
| リ | 1 | |
| ド | 1 | |
| フ | 1 | |
| ィ | 1 | |
| ャ | 1 | |
| チ | 1 |
Diacriticals
| Value | Count | Frequency (%) |
| ́ | 1 |
| Distinct | 868 |
|---|---|
| Distinct (%) | 8.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 156.2 KiB |
Length
| Max length | 517 |
|---|---|
| Median length | 433 |
| Mean length | 69.0281 |
| Min length | 2 |
Characters and Unicode
| Total characters | 690281 |
|---|---|
| Distinct characters | 63 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 613 ? |
|---|---|
| Unique (%) | 6.1% |
Sample
| 1st row | [{'iso_3166_1': 'US', 'name': 'United States of America'}] |
|---|---|
| 2nd row | [{'iso_3166_1': 'IT', 'name': 'Italy'}, {'iso_3166_1': 'US', 'name': 'United States of America'}] |
| 3rd row | [{'iso_3166_1': 'US', 'name': 'United States of America'}] |
| 4th row | [{'iso_3166_1': 'US', 'name': 'United States of America'}] |
| 5th row | [{'iso_3166_1': 'ES', 'name': 'Spain'}] |
| Value | Count | Frequency (%) |
| iso_3166_1 | 13839 | |
| name | 13839 | |
| united | 7991 | |
| states | 6750 | |
| of | 6750 | |
| america | 6750 | |
| us | 6750 | |
| gb | 1222 | 1.6% |
| kingdom | 1222 | 1.6% |
| france | 758 | 1.0% |
| Other values (220) | 11956 |
Most occurring characters
| Value | Count | Frequency (%) |
| ' | 110712 | |
| 67827 | 9.8% | |
| e | 38199 | 5.5% |
| a | 34681 | 5.0% |
| i | 31757 | 4.6% |
| 6 | 27678 | 4.0% |
| _ | 27678 | 4.0% |
| : | 27678 | 4.0% |
| 1 | 27678 | 4.0% |
| n | 27633 | 4.0% |
| Other values (53) | 268760 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 264597 | |
| Other Punctuation | 156247 | |
| Decimal Number | 69195 | 10.0% |
| Space Separator | 67827 | 9.8% |
| Uppercase Letter | 57059 | 8.3% |
| Connector Punctuation | 27678 | 4.0% |
| Close Punctuation | 23839 | 3.5% |
| Open Punctuation | 23839 | 3.5% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 15105 | |
| S | 14727 | |
| A | 8012 | |
| K | 2529 | 4.4% |
| C | 1863 | 3.3% |
| G | 1787 | 3.1% |
| B | 1656 | 2.9% |
| F | 1568 | 2.7% |
| R | 1476 | 2.6% |
| J | 1448 | 2.5% |
| Other values (16) | 6888 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 38199 | |
| a | 34681 | |
| i | 31757 | |
| n | 27633 | |
| o | 23538 | |
| t | 22642 | |
| m | 22606 | |
| s | 21117 | |
| d | 10373 | 3.9% |
| r | 9279 | 3.5% |
| Other values (15) | 22772 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 110712 | |
| : | 27678 | 17.7% |
| , | 17857 | 11.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 27678 | |
| 1 | 27678 | |
| 3 | 13839 |
Close Punctuation
| Value | Count | Frequency (%) |
| } | 13839 | |
| ] | 10000 |
Open Punctuation
| Value | Count | Frequency (%) |
| { | 13839 | |
| [ | 10000 |
Space Separator
| Value | Count | Frequency (%) |
| 67827 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 27678 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 368625 | |
| Latin | 321656 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 38199 | |
| a | 34681 | |
| i | 31757 | |
| n | 27633 | 8.6% |
| o | 23538 | 7.3% |
| t | 22642 | 7.0% |
| m | 22606 | 7.0% |
| s | 21117 | 6.6% |
| U | 15105 | 4.7% |
| S | 14727 | 4.6% |
| Other values (41) | 69651 |
Common
| Value | Count | Frequency (%) |
| ' | 110712 | |
| 67827 | ||
| 6 | 27678 | 7.5% |
| _ | 27678 | 7.5% |
| : | 27678 | 7.5% |
| 1 | 27678 | 7.5% |
| , | 17857 | 4.8% |
| 3 | 13839 | 3.8% |
| } | 13839 | 3.8% |
| { | 13839 | 3.8% |
| Other values (2) | 20000 | 5.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 690281 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| ' | 110712 | |
| 67827 | 9.8% | |
| e | 38199 | 5.5% |
| a | 34681 | 5.0% |
| i | 31757 | 4.6% |
| 6 | 27678 | 4.0% |
| _ | 27678 | 4.0% |
| : | 27678 | 4.0% |
| 1 | 27678 | 4.0% |
| n | 27633 | 4.0% |
| Other values (53) | 268760 |
SpokenLanguages
Text
| Distinct | 974 |
|---|---|
| Distinct (%) | 9.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 156.2 KiB |
Length
| Max length | 725 |
|---|---|
| Median length | 67 |
| Mean length | 94.5997 |
| Min length | 2 |
Characters and Unicode
| Total characters | 945997 |
|---|---|
| Distinct characters | 198 |
| Distinct categories | 12 ? |
| Distinct scripts | 16 ? |
| Distinct blocks | 16 ? |
Unique
| Unique | 740 ? |
|---|---|
| Unique (%) | 7.4% |
Sample
| 1st row | [{'english_name': 'English', 'iso_639_1': 'en', 'name': 'English'}] |
|---|---|
| 2nd row | [{'english_name': 'English', 'iso_639_1': 'en', 'name': 'English'}, {'english_name': 'Italian', 'iso_639_1': 'it', 'name': 'Italiano'}] |
| 3rd row | [{'english_name': 'English', 'iso_639_1': 'en', 'name': 'English'}] |
| 4th row | [{'english_name': 'French', 'iso_639_1': 'fr', 'name': 'Français'}, {'english_name': 'English', 'iso_639_1': 'en', 'name': 'English'}, {'english_name': 'Italian', 'iso_639_1': 'it', 'name': 'Italiano'}, {'english_name': 'Russian', 'iso_639_1': 'ru', 'name': 'Pусский'}] |
| 5th row | [{'english_name': 'Spanish', 'iso_639_1': 'es', 'name': 'Español'}] |
| Value | Count | Frequency (%) |
| english | 15714 | |
| english_name | 14176 | |
| iso_639_1 | 14176 | |
| name | 14176 | |
| en | 7857 | |
| spanish | 895 | 1.0% |
| español | 895 | 1.0% |
| es | 895 | 1.0% |
| français | 851 | 1.0% |
| french | 851 | 1.0% |
| Other values (277) | 15188 |
Most occurring characters
| Value | Count | Frequency (%) |
| ' | 170112 | |
| 75674 | 8.0% | |
| n | 74418 | 7.9% |
| e | 57384 | 6.1% |
| s | 51181 | 5.4% |
| i | 49770 | 5.3% |
| : | 42528 | 4.5% |
| _ | 42528 | 4.5% |
| a | 40466 | 4.3% |
| h | 33241 | 3.5% |
| Other values (188) | 308695 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 441401 | |
| Other Punctuation | 245822 | |
| Space Separator | 75674 | 8.0% |
| Decimal Number | 56704 | 6.0% |
| Connector Punctuation | 42528 | 4.5% |
| Uppercase Letter | 25980 | 2.7% |
| Close Punctuation | 24176 | 2.6% |
| Open Punctuation | 24176 | 2.6% |
| Other Letter | 9047 | 1.0% |
| Nonspacing Mark | 254 | < 0.1% |
| Other values (2) | 235 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 74418 | |
| e | 57384 | |
| s | 51181 | |
| i | 49770 | |
| a | 40466 | |
| h | 33241 | |
| l | 32548 | |
| g | 30645 | |
| m | 29009 | 6.6% |
| o | 17466 | 4.0% |
| Other values (67) | 25273 | 5.7% |
Other Letter
| Value | Count | Frequency (%) |
| 語 | 801 | 8.9% |
| 日 | 801 | 8.9% |
| 本 | 801 | 8.9% |
| 话 | 566 | 6.3% |
| 州 | 438 | 4.8% |
| 국 | 366 | 4.0% |
| 조 | 366 | 4.0% |
| 말 | 366 | 4.0% |
| 한 | 366 | 4.0% |
| 선 | 366 | 4.0% |
| Other values (49) | 3810 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 16635 | |
| F | 1722 | 6.6% |
| S | 1031 | 4.0% |
| I | 1022 | 3.9% |
| J | 803 | 3.1% |
| P | 765 | 2.9% |
| D | 632 | 2.4% |
| G | 558 | 2.1% |
| M | 410 | 1.6% |
| K | 385 | 1.5% |
| Other values (19) | 2017 | 7.8% |
Spacing Mark
| Value | Count | Frequency (%) |
| ि | 82 | |
| ी | 82 | |
| ు | 24 | 10.3% |
| া | 14 | 6.0% |
| ੀ | 7 | 3.0% |
| ং | 7 | 3.0% |
| ி | 7 | 3.0% |
| ਾ | 7 | 3.0% |
| ං | 2 | 0.9% |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ִ | 96 | |
| ् | 82 | |
| ְ | 48 | |
| ె | 12 | 4.7% |
| ் | 7 | 2.8% |
| ੰ | 7 | 2.8% |
| ි | 2 | 0.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 170112 | |
| : | 42528 | 17.3% |
| , | 32581 | 13.3% |
| / | 585 | 0.2% |
| ? | 10 | < 0.1% |
| ; | 6 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 14176 | |
| 1 | 14176 | |
| 3 | 14176 | |
| 6 | 14176 |
Close Punctuation
| Value | Count | Frequency (%) |
| } | 14176 | |
| ] | 10000 |
Open Punctuation
| Value | Count | Frequency (%) |
| { | 14176 | |
| [ | 10000 |
Space Separator
| Value | Count | Frequency (%) |
| 75674 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 42528 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 469083 | |
| Latin | 464699 | |
| Han | 4758 | 0.5% |
| Cyrillic | 2299 | 0.2% |
| Hangul | 2196 | 0.2% |
| Arabic | 1061 | 0.1% |
| Devanagari | 492 | 0.1% |
| Thai | 448 | < 0.1% |
| Hebrew | 384 | < 0.1% |
| Greek | 320 | < 0.1% |
| Other values (6) | 257 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 74418 | |
| e | 57384 | |
| s | 51181 | |
| i | 49770 | |
| a | 40466 | |
| h | 33241 | |
| l | 32548 | |
| g | 30645 | |
| m | 29009 | 6.2% |
| o | 17466 | 3.8% |
| Other values (60) | 48571 |
Cyrillic
| Value | Count | Frequency (%) |
| с | 669 | |
| к | 383 | |
| и | 360 | |
| й | 340 | |
| у | 319 | |
| а | 37 | 1.6% |
| р | 33 | 1.4% |
| н | 22 | 1.0% |
| ь | 22 | 1.0% |
| У | 22 | 1.0% |
| Other values (12) | 92 | 4.0% |
Common
| Value | Count | Frequency (%) |
| ' | 170112 | |
| 75674 | ||
| : | 42528 | 9.1% |
| _ | 42528 | 9.1% |
| , | 32581 | 6.9% |
| 9 | 14176 | 3.0% |
| } | 14176 | 3.0% |
| 1 | 14176 | 3.0% |
| { | 14176 | 3.0% |
| 3 | 14176 | 3.0% |
| Other values (7) | 34780 | 7.4% |
Arabic
| Value | Count | Frequency (%) |
| ا | 162 | |
| ر | 162 | |
| ة | 129 | |
| ي | 129 | |
| ع | 129 | |
| ب | 129 | |
| ل | 129 | |
| س | 18 | 1.7% |
| ی | 18 | 1.7% |
| ف | 18 | 1.7% |
| Other values (5) | 38 | 3.6% |
Han
| Value | Count | Frequency (%) |
| 語 | 801 | |
| 日 | 801 | |
| 本 | 801 | |
| 话 | 566 | |
| 州 | 438 | |
| 通 | 347 | |
| 普 | 347 | |
| 話 | 219 | 4.6% |
| 廣 | 219 | 4.6% |
| 广 | 219 | 4.6% |
Hebrew
| Value | Count | Frequency (%) |
| ִ | 96 | |
| ר | 48 | |
| ת | 48 | |
| י | 48 | |
| ע | 48 | |
| ְ | 48 | |
| ב | 48 |
Greek
| Value | Count | Frequency (%) |
| λ | 80 | |
| κ | 40 | |
| ι | 40 | |
| ν | 40 | |
| η | 40 | |
| ά | 40 | |
| ε | 40 |
Georgian
| Value | Count | Frequency (%) |
| ქ | 9 | |
| ლ | 9 | |
| უ | 9 | |
| ა | 9 | |
| თ | 9 | |
| ი | 9 | |
| რ | 9 |
Hangul
| Value | Count | Frequency (%) |
| 국 | 366 | |
| 조 | 366 | |
| 말 | 366 | |
| 한 | 366 | |
| 선 | 366 | |
| 어 | 366 |
Thai
| Value | Count | Frequency (%) |
| า | 128 | |
| ท | 64 | |
| ย | 64 | |
| ไ | 64 | |
| ษ | 64 | |
| ภ | 64 |
Devanagari
| Value | Count | Frequency (%) |
| ि | 82 | |
| द | 82 | |
| ् | 82 | |
| न | 82 | |
| ी | 82 | |
| ह | 82 |
Gurmukhi
| Value | Count | Frequency (%) |
| ਬ | 7 | |
| ਪ | 7 | |
| ੰ | 7 | |
| ੀ | 7 | |
| ਾ | 7 | |
| ਜ | 7 |
Telugu
| Value | Count | Frequency (%) |
| ు | 24 | |
| ల | 12 | |
| గ | 12 | |
| ె | 12 | |
| త | 12 |
Tamil
| Value | Count | Frequency (%) |
| ் | 7 | |
| ழ | 7 | |
| ி | 7 | |
| ம | 7 | |
| த | 7 |
Sinhala
| Value | Count | Frequency (%) |
| හ | 2 | |
| ල | 2 | |
| ං | 2 | |
| ි | 2 | |
| ස | 2 |
Bengali
| Value | Count | Frequency (%) |
| া | 14 | |
| ং | 7 | |
| ল | 7 | |
| ব | 7 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 931602 | |
| CJK | 4758 | 0.5% |
| None | 2434 | 0.3% |
| Cyrillic | 2299 | 0.2% |
| Hangul | 2196 | 0.2% |
| Arabic | 1061 | 0.1% |
| Devanagari | 492 | 0.1% |
| Thai | 448 | < 0.1% |
| Hebrew | 384 | < 0.1% |
| Telugu | 72 | < 0.1% |
| Other values (6) | 251 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| ' | 170112 | |
| 75674 | 8.1% | |
| n | 74418 | 8.0% |
| e | 57384 | 6.2% |
| s | 51181 | 5.5% |
| i | 49770 | 5.3% |
| : | 42528 | 4.6% |
| _ | 42528 | 4.6% |
| a | 40466 | 4.3% |
| h | 33241 | 3.6% |
| Other values (58) | 294300 |
None
| Value | Count | Frequency (%) |
| ñ | 895 | |
| ç | 890 | |
| ê | 146 | 6.0% |
| λ | 80 | 3.3% |
| κ | 40 | 1.6% |
| ι | 40 | 1.6% |
| ν | 40 | 1.6% |
| η | 40 | 1.6% |
| ά | 40 | 1.6% |
| ε | 40 | 1.6% |
| Other values (14) | 183 | 7.5% |
CJK
| Value | Count | Frequency (%) |
| 語 | 801 | |
| 日 | 801 | |
| 本 | 801 | |
| 话 | 566 | |
| 州 | 438 | |
| 通 | 347 | |
| 普 | 347 | |
| 話 | 219 | 4.6% |
| 廣 | 219 | 4.6% |
| 广 | 219 | 4.6% |
Cyrillic
| Value | Count | Frequency (%) |
| с | 669 | |
| к | 383 | |
| и | 360 | |
| й | 340 | |
| у | 319 | |
| а | 37 | 1.6% |
| р | 33 | 1.4% |
| н | 22 | 1.0% |
| ь | 22 | 1.0% |
| У | 22 | 1.0% |
| Other values (12) | 92 | 4.0% |
Hangul
| Value | Count | Frequency (%) |
| 국 | 366 | |
| 조 | 366 | |
| 말 | 366 | |
| 한 | 366 | |
| 선 | 366 | |
| 어 | 366 |
Arabic
| Value | Count | Frequency (%) |
| ا | 162 | |
| ر | 162 | |
| ة | 129 | |
| ي | 129 | |
| ع | 129 | |
| ب | 129 | |
| ل | 129 | |
| س | 18 | 1.7% |
| ی | 18 | 1.7% |
| ف | 18 | 1.7% |
| Other values (5) | 38 | 3.6% |
Thai
| Value | Count | Frequency (%) |
| า | 128 | |
| ท | 64 | |
| ย | 64 | |
| ไ | 64 | |
| ษ | 64 | |
| ภ | 64 |
Hebrew
| Value | Count | Frequency (%) |
| ִ | 96 | |
| ר | 48 | |
| ת | 48 | |
| י | 48 | |
| ע | 48 | |
| ְ | 48 | |
| ב | 48 |
Devanagari
| Value | Count | Frequency (%) |
| ि | 82 | |
| द | 82 | |
| ् | 82 | |
| न | 82 | |
| ी | 82 | |
| ह | 82 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ệ | 33 | |
| ế | 33 |
Telugu
| Value | Count | Frequency (%) |
| ు | 24 | |
| ల | 12 | |
| గ | 12 | |
| ె | 12 | |
| త | 12 |
Bengali
| Value | Count | Frequency (%) |
| া | 14 | |
| ং | 7 | |
| ল | 7 | |
| ব | 7 |
Georgian
| Value | Count | Frequency (%) |
| ქ | 9 | |
| ლ | 9 | |
| უ | 9 | |
| ა | 9 | |
| თ | 9 | |
| ი | 9 | |
| რ | 9 |
Gurmukhi
| Value | Count | Frequency (%) |
| ਬ | 7 | |
| ਪ | 7 | |
| ੰ | 7 | |
| ੀ | 7 | |
| ਾ | 7 | |
| ਜ | 7 |
Tamil
| Value | Count | Frequency (%) |
| ் | 7 | |
| ழ | 7 | |
| ி | 7 | |
| ம | 7 | |
| த | 7 |
Sinhala
| Value | Count | Frequency (%) |
| හ | 2 | |
| ල | 2 | |
| ං | 2 | |
| ි | 2 | |
| ස | 2 |
TagLine
Text
MISSING 
| Distinct | 7530 |
|---|---|
| Distinct (%) | 99.2% |
| Missing | 2413 |
| Missing (%) | 24.1% |
| Memory size | 156.2 KiB |
Length
| Max length | 206 |
|---|---|
| Median length | 143 |
| Mean length | 39.759457 |
| Min length | 3 |
Characters and Unicode
| Total characters | 301655 |
|---|---|
| Distinct characters | 100 |
| Distinct categories | 14 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 4 ? |
Unique
| Unique | 7479 ? |
|---|---|
| Unique (%) | 98.6% |
Sample
| 1st row | They'll die when they're dead. |
|---|---|
| 2nd row | Justice knows no borders. |
| 3rd row | Neon lights... Suits with shoulder pads... Jumping from explosions in slow motion... |
| 4th row | We all share the same fate. |
| 5th row | Attempting to survive in the middle of nowhere is her only option. |
| Value | Count | Frequency (%) |
| the | 3549 | 6.4% |
| a | 1988 | 3.6% |
| to | 1204 | 2.2% |
| of | 1165 | 2.1% |
| is | 1124 | 2.0% |
| you | 967 | 1.7% |
| in | 791 | 1.4% |
| and | 586 | 1.1% |
| for | 579 | 1.0% |
| one | 502 | 0.9% |
| Other values (6347) | 43096 |
Most occurring characters
| Value | Count | Frequency (%) |
| 47972 | ||
| e | 31978 | 10.6% |
| t | 18698 | 6.2% |
| o | 18615 | 6.2% |
| a | 16132 | 5.3% |
| n | 15291 | 5.1% |
| r | 14757 | 4.9% |
| i | 14757 | 4.9% |
| s | 14165 | 4.7% |
| h | 12027 | 4.0% |
| Other values (90) | 97263 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 222152 | |
| Space Separator | 47982 | 15.9% |
| Uppercase Letter | 16471 | 5.5% |
| Other Punctuation | 13766 | 4.6% |
| Decimal Number | 867 | 0.3% |
| Dash Punctuation | 259 | 0.1% |
| Final Punctuation | 115 | < 0.1% |
| Close Punctuation | 14 | < 0.1% |
| Open Punctuation | 14 | < 0.1% |
| Currency Symbol | 8 | < 0.1% |
| Other values (4) | 7 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 31978 | |
| t | 18698 | 8.4% |
| o | 18615 | 8.4% |
| a | 16132 | 7.3% |
| n | 15291 | 6.9% |
| r | 14757 | 6.6% |
| i | 14757 | 6.6% |
| s | 14165 | 6.4% |
| h | 12027 | 5.4% |
| l | 9614 | 4.3% |
| Other values (24) | 56118 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 2658 | |
| A | 1464 | 8.9% |
| S | 1187 | 7.2% |
| W | 974 | 5.9% |
| I | 957 | 5.8% |
| H | 942 | 5.7% |
| B | 806 | 4.9% |
| N | 753 | 4.6% |
| F | 745 | 4.5% |
| E | 724 | 4.4% |
| Other values (16) | 5261 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 9252 | |
| ' | 1727 | 12.5% |
| , | 1224 | 8.9% |
| ! | 886 | 6.4% |
| ? | 419 | 3.0% |
| … | 101 | 0.7% |
| " | 63 | 0.5% |
| : | 37 | 0.3% |
| % | 17 | 0.1% |
| * | 13 | 0.1% |
| Other values (4) | 27 | 0.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 254 | |
| 1 | 184 | |
| 2 | 103 | |
| 9 | 64 | 7.4% |
| 3 | 60 | 6.9% |
| 5 | 49 | 5.7% |
| 7 | 42 | 4.8% |
| 6 | 41 | 4.7% |
| 8 | 36 | 4.2% |
| 4 | 34 | 3.9% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 247 | |
| — | 8 | 3.1% |
| – | 4 | 1.5% |
Space Separator
| Value | Count | Frequency (%) |
| 47972 | ||
| 10 | < 0.1% |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 113 | |
| ” | 2 | 1.7% |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 2 | |
| ‘ | 1 |
Math Symbol
| Value | Count | Frequency (%) |
| ~ | 1 | |
| + | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 14 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 14 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 8 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 1 |
Other Number
| Value | Count | Frequency (%) |
| ½ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 238623 | |
| Common | 63032 | 20.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 31978 | |
| t | 18698 | 7.8% |
| o | 18615 | 7.8% |
| a | 16132 | 6.8% |
| n | 15291 | 6.4% |
| r | 14757 | 6.2% |
| i | 14757 | 6.2% |
| s | 14165 | 5.9% |
| h | 12027 | 5.0% |
| l | 9614 | 4.0% |
| Other values (50) | 72589 |
Common
| Value | Count | Frequency (%) |
| 47972 | ||
| . | 9252 | 14.7% |
| ' | 1727 | 2.7% |
| , | 1224 | 1.9% |
| ! | 886 | 1.4% |
| ? | 419 | 0.7% |
| 0 | 254 | 0.4% |
| - | 247 | 0.4% |
| 1 | 184 | 0.3% |
| ’ | 113 | 0.2% |
| Other values (30) | 754 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 301398 | |
| Punctuation | 231 | 0.1% |
| None | 25 | < 0.1% |
| Modifier Letters | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 47972 | ||
| e | 31978 | 10.6% |
| t | 18698 | 6.2% |
| o | 18615 | 6.2% |
| a | 16132 | 5.4% |
| n | 15291 | 5.1% |
| r | 14757 | 4.9% |
| i | 14757 | 4.9% |
| s | 14165 | 4.7% |
| h | 12027 | 4.0% |
| Other values (72) | 97006 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 113 | |
| … | 101 | |
| — | 8 | 3.5% |
| – | 4 | 1.7% |
| ” | 2 | 0.9% |
| “ | 2 | 0.9% |
| ‘ | 1 | 0.4% |
None
| Value | Count | Frequency (%) |
| 10 | ||
| é | 4 | 16.0% |
| ñ | 2 | 8.0% |
| ü | 2 | 8.0% |
| á | 2 | 8.0% |
| ō | 1 | 4.0% |
| ê | 1 | 4.0% |
| ù | 1 | 4.0% |
| í | 1 | 4.0% |
| ½ | 1 | 4.0% |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 1 |
RunTime
Real number (ℝ)
ZEROS 
| Distinct | 220 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 101.7372 |
| Minimum | 0 |
|---|---|
| Maximum | 400 |
| Zeros | 137 |
| Zeros (%) | 1.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 156.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 63 |
| Q1 | 91 |
| median | 101 |
| Q3 | 115 |
| 95-th percentile | 140 |
| Maximum | 400 |
| Range | 400 |
| Interquartile range (IQR) | 24 |
Descriptive statistics
| Standard deviation | 27.703785 |
|---|---|
| Coefficient of variation (CV) | 0.27230732 |
| Kurtosis | 6.3812347 |
| Mean | 101.7372 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | -0.41509431 |
| Sum | 1017372 |
| Variance | 767.49969 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 90 | 295 | 2.9% |
| 95 | 283 | 2.8% |
| 100 | 275 | 2.8% |
| 93 | 269 | 2.7% |
| 105 | 245 | 2.5% |
| 97 | 240 | 2.4% |
| 98 | 238 | 2.4% |
| 94 | 228 | 2.3% |
| 92 | 227 | 2.3% |
| 96 | 226 | 2.3% |
| Other values (210) | 7474 |
| Value | Count | Frequency (%) |
| 0 | 137 | |
| 2 | 3 | < 0.1% |
| 3 | 8 | 0.1% |
| 4 | 6 | 0.1% |
| 5 | 9 | 0.1% |
| 6 | 14 | 0.1% |
| 7 | 10 | 0.1% |
| 8 | 8 | 0.1% |
| 9 | 8 | 0.1% |
| 10 | 13 | 0.1% |
| Value | Count | Frequency (%) |
| 400 | 1 | |
| 333 | 1 | |
| 317 | 1 | |
| 254 | 1 | |
| 248 | 1 | |
| 247 | 1 | |
| 242 | 2 | |
| 240 | 1 | |
| 238 | 2 | |
| 237 | 1 |
Revenue
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 5579 |
|---|---|
| Distinct (%) | 55.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 62253720 |
| Minimum | 0 |
|---|---|
| Maximum | 2.923706 × 109 |
| Zeros | 4155 |
| Zeros (%) | 41.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 156.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 3720364 |
| Q3 | 54054618 |
| 95-th percentile | 3.0726891 × 108 |
| Maximum | 2.923706 × 109 |
| Range | 2.923706 × 109 |
| Interquartile range (IQR) | 54054618 |
Descriptive statistics
| Standard deviation | 1.5623949 × 108 |
|---|---|
| Coefficient of variation (CV) | 2.5097214 |
| Kurtosis | 55.356164 |
| Mean | 62253720 |
| Median Absolute Deviation (MAD) | 3720364 |
| Skewness | 5.8965705 |
| Sum | 6.225372 × 1011 |
| Variance | 2.4410779 × 1016 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 4155 | |
| 11000000 | 11 | 0.1% |
| 2000000 | 10 | 0.1% |
| 10000000 | 10 | 0.1% |
| 12000000 | 10 | 0.1% |
| 30000000 | 9 | 0.1% |
| 7000000 | 8 | 0.1% |
| 25000000 | 8 | 0.1% |
| 5000000 | 7 | 0.1% |
| 8000000 | 7 | 0.1% |
| Other values (5569) | 5765 |
| Value | Count | Frequency (%) |
| 0 | 4155 | |
| 1 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| 29 | 1 | < 0.1% |
| 43 | 1 | < 0.1% |
| 94 | 1 | < 0.1% |
| 126 | 1 | < 0.1% |
| 201 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 2923706026 | 1 | |
| 2800000000 | 1 | |
| 2320250281 | 1 | |
| 2264162353 | 1 | |
| 2068223624 | 1 | |
| 2052415039 | 1 | |
| 1921847111 | 1 | |
| 1671537444 | 1 | |
| 1663075401 | 1 | |
| 1518815515 | 1 |
| Id | Popularity | VoteAverage | VoteCount | Budget | RunTime | Revenue | OriginalLanguage | |
|---|---|---|---|---|---|---|---|---|
| Id | 1.000 | 0.082 | -0.135 | -0.510 | -0.456 | -0.212 | -0.495 | 0.147 |
| Popularity | 0.082 | 1.000 | 0.161 | 0.343 | 0.245 | 0.078 | 0.278 | 0.000 |
| VoteAverage | -0.135 | 0.161 | 1.000 | 0.322 | 0.063 | 0.319 | 0.163 | 0.117 |
| VoteCount | -0.510 | 0.343 | 0.322 | 1.000 | 0.697 | 0.365 | 0.747 | 0.000 |
| Budget | -0.456 | 0.245 | 0.063 | 0.697 | 1.000 | 0.373 | 0.794 | 0.017 |
| RunTime | -0.212 | 0.078 | 0.319 | 0.365 | 0.373 | 1.000 | 0.387 | 0.123 |
| Revenue | -0.495 | 0.278 | 0.163 | 0.747 | 0.794 | 0.387 | 1.000 | 0.000 |
| OriginalLanguage | 0.147 | 0.000 | 0.117 | 0.000 | 0.017 | 0.123 | 0.000 | 1.000 |
| GenreIds | Id | OriginalLanguage | OriginalTitle | Overview | Popularity | ReleaseDate | Title | VoteAverage | VoteCount | Budget | ProductionCompanies | ProductionCountries | SpokenLanguages | TagLine | RunTime | Revenue | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | [28, 12, 53] | 299054 | en | Expend4bles | Armed with every weapon they can get their hands on and the skills to use them, The Expendables are the world’s last line of defense and the team that gets called when all other options are off the table. But new team members with new styles and tactics are going to give “new blood” a whole new meaning. | 3741.062 | 2023-09-15 | Expend4bles | 6.4 | 364 | 100000000 | [{'id': 1020, 'logo_path': '/kuUIHNwMec4dwOLghDhhZJzHZTd.png', 'name': 'Millennium Media', 'origin_country': 'US'}, {'id': 48738, 'logo_path': None, 'name': 'Campbell Grobman Films', 'origin_country': 'US'}, {'id': 1632, 'logo_path': '/cisLn1YAUuptXVBa0xjq7ST9cH0.png', 'name': 'Lionsgate', 'origin_country': 'US'}] | [{'iso_3166_1': 'US', 'name': 'United States of America'}] | [{'english_name': 'English', 'iso_639_1': 'en', 'name': 'English'}] | They'll die when they're dead. | 103 | 30000000 |
| 1 | [28, 53, 80] | 926393 | en | The Equalizer 3 | Robert McCall finds himself at home in Southern Italy but he discovers his friends are under the control of local crime bosses. As events turn deadly, McCall knows what he has to do: become his friends' protector by taking on the mafia. | 2471.515 | 2023-08-30 | The Equalizer 3 | 7.3 | 1027 | 70000000 | [{'id': 1423, 'logo_path': '/1rbAwGQzrNvXDICD6HWEn1YqfAV.png', 'name': 'Escape Artists', 'origin_country': 'US'}, {'id': 5, 'logo_path': '/wrweLpBqRYcAM7kCSaHDJRxKGOP.png', 'name': 'Columbia Pictures', 'origin_country': 'US'}, {'id': 10400, 'logo_path': '/9LlB2YAwXTkUAhx0pItSo6pDlkB.png', 'name': 'Eagle Pictures', 'origin_country': 'IT'}, {'id': 44967, 'logo_path': None, 'name': 'ZHIV Productions', 'origin_country': ''}] | [{'iso_3166_1': 'IT', 'name': 'Italy'}, {'iso_3166_1': 'US', 'name': 'United States of America'}] | [{'english_name': 'English', 'iso_639_1': 'en', 'name': 'English'}, {'english_name': 'Italian', 'iso_639_1': 'it', 'name': 'Italiano'}] | Justice knows no borders. | 109 | 176933602 |
| 2 | [16, 28, 14] | 1034062 | en | Mortal Kombat Legends: Cage Match | In 1980s Hollywood, action star Johnny Cage is looking to become an A-list actor. But when his costar, Jennifer, goes missing from set, Johnny finds himself thrust into a world filled with shadows, danger, and deceit. As he embarks on a bloody journey, Johnny quickly discovers the City of Angels has more than a few devils in its midst. | 2223.430 | 2023-10-17 | Mortal Kombat Legends: Cage Match | 7.8 | 27 | 0 | [{'id': 2785, 'logo_path': '/l5zW8jjmQOCx2dFmvnmbYmqoBmL.png', 'name': 'Warner Bros. Animation', 'origin_country': 'US'}] | [{'iso_3166_1': 'US', 'name': 'United States of America'}] | [{'english_name': 'English', 'iso_639_1': 'en', 'name': 'English'}] | Neon lights... Suits with shoulder pads... Jumping from explosions in slow motion... | 80 | 0 |
| 3 | [28, 53] | 575264 | en | Mission: Impossible - Dead Reckoning Part One | Ethan Hunt and his IMF team embark on their most dangerous mission yet: To track down a terrifying new weapon that threatens all of humanity before it falls into the wrong hands. With control of the future and the world's fate at stake and dark forces from Ethan's past closing in, a deadly race around the globe begins. Confronted by a mysterious, all-powerful enemy, Ethan must consider that nothing can matter more than his mission—not even the lives of those he cares about most. | 2032.927 | 2023-07-08 | Mission: Impossible - Dead Reckoning Part One | 7.7 | 1799 | 291000000 | [{'id': 4, 'logo_path': '/gz66EfNoYPqHTYI4q9UEN4CbHRc.png', 'name': 'Paramount', 'origin_country': 'US'}, {'id': 82819, 'logo_path': '/gXfFl9pRPaoaq14jybEn1pHeldr.png', 'name': 'Skydance', 'origin_country': 'US'}, {'id': 21777, 'logo_path': None, 'name': 'TC Productions', 'origin_country': 'US'}] | [{'iso_3166_1': 'US', 'name': 'United States of America'}] | [{'english_name': 'French', 'iso_639_1': 'fr', 'name': 'Français'}, {'english_name': 'English', 'iso_639_1': 'en', 'name': 'English'}, {'english_name': 'Italian', 'iso_639_1': 'it', 'name': 'Italiano'}, {'english_name': 'Russian', 'iso_639_1': 'ru', 'name': 'Pусский'}] | We all share the same fate. | 164 | 567148955 |
| 4 | [53, 18] | 1151534 | es | Nowhere | A young pregnant woman named Mia escapes from a country at war by hiding in a maritime container aboard a cargo ship. After a violent storm, Mia gives birth to the child while lost at sea, where she must fight to survive. | 1627.678 | 2023-09-29 | Nowhere | 7.6 | 686 | 0 | [{'id': 204005, 'logo_path': None, 'name': 'Rock & Ruz', 'origin_country': 'ES'}] | [{'iso_3166_1': 'ES', 'name': 'Spain'}] | [{'english_name': 'Spanish', 'iso_639_1': 'es', 'name': 'Español'}] | Attempting to survive in the middle of nowhere is her only option. | 109 | 0 |
| 5 | [27, 9648, 53] | 968051 | en | The Nun II | In 1956 France, a priest is violently murdered, and Sister Irene begins to investigate. She once again comes face-to-face with a powerful evil. | 1594.559 | 2023-09-06 | The Nun II | 7.0 | 1086 | 38500000 | [{'id': 12, 'logo_path': '/mevhneWSqbjU22D1MXNd4H9x0r0.png', 'name': 'New Line Cinema', 'origin_country': 'US'}, {'id': 76907, 'logo_path': '/ygMQtjsKX7BZkCQhQZY82lgnCUO.png', 'name': 'Atomic Monster', 'origin_country': 'US'}, {'id': 11565, 'logo_path': None, 'name': 'The Safran Company', 'origin_country': 'US'}] | [{'iso_3166_1': 'US', 'name': 'United States of America'}] | [{'english_name': 'English', 'iso_639_1': 'en', 'name': 'English'}, {'english_name': 'French', 'iso_639_1': 'fr', 'name': 'Français'}] | Confess your sins. | 110 | 262010000 |
| 6 | [28, 80, 53] | 961268 | ko | 발레리나 | Grieving the loss of a best friend she couldn't protect, an ex-bodyguard sets out to fulfill her dear friend's last wish: sweet revenge. | 1521.075 | 2023-10-05 | Ballerina | 7.0 | 200 | 0 | [{'id': 127541, 'logo_path': '/Aq35mXuZv7lhPm8a60YKRaB9Vek.png', 'name': 'Climax Studios', 'origin_country': 'KR'}] | [{'iso_3166_1': 'KR', 'name': 'South Korea'}] | [{'english_name': 'Korean', 'iso_639_1': 'ko', 'name': '한국어/조선말'}] | Merciless and ruthless, to hell. | 93 | 0 |
| 7 | [27, 53] | 951491 | en | Saw X | Between the events of 'Saw' and 'Saw II', a sick and desperate John Kramer travels to Mexico for a risky and experimental medical procedure in hopes of a miracle cure for his cancer, only to discover the entire operation is a scam to defraud the most vulnerable. Armed with a newfound purpose, the infamous serial killer returns to his work, turning the tables on the con artists in his signature visceral way through devious, deranged, and ingenious traps. | 1469.177 | 2023-09-26 | Saw X | 7.3 | 287 | 13000000 | [{'id': 2061, 'logo_path': '/o9LbN33hRaj4qcebUv1bikyXeoB.png', 'name': 'Twisted Pictures', 'origin_country': 'US'}, {'id': 1632, 'logo_path': '/cisLn1YAUuptXVBa0xjq7ST9cH0.png', 'name': 'Lionsgate', 'origin_country': 'US'}] | [{'iso_3166_1': 'US', 'name': 'United States of America'}] | [{'english_name': 'English', 'iso_639_1': 'en', 'name': 'English'}, {'english_name': 'Spanish', 'iso_639_1': 'es', 'name': 'Español'}] | Witness the return of Jigsaw. | 118 | 71984243 |
| 8 | [12, 28, 18] | 980489 | en | Gran Turismo | The ultimate wish-fulfillment tale of a teenage Gran Turismo player whose gaming skills won him a series of Nissan competitions to become an actual professional racecar driver. | 1315.518 | 2023-08-09 | Gran Turismo | 8.1 | 1127 | 60000000 | [{'id': 125281, 'logo_path': '/3hV8pyxzAJgEjiSYVv1WZ0ZYayp.png', 'name': 'PlayStation Productions', 'origin_country': 'US'}, {'id': 84792, 'logo_path': '/7Rfr3Zu6QnHpXW2VdSEzUminAQd.png', 'name': '2.0 Entertainment', 'origin_country': 'US'}, {'id': 5, 'logo_path': '/wrweLpBqRYcAM7kCSaHDJRxKGOP.png', 'name': 'Columbia Pictures', 'origin_country': 'US'}] | [{'iso_3166_1': 'US', 'name': 'United States of America'}] | [{'english_name': 'English', 'iso_639_1': 'en', 'name': 'English'}, {'english_name': 'Japanese', 'iso_639_1': 'ja', 'name': '日本語'}, {'english_name': 'German', 'iso_639_1': 'de', 'name': 'Deutsch'}] | From gamer to racer. | 135 | 114800000 |
| 9 | [53, 878, 28] | 937249 | en | 57 Seconds | When a tech blogger lands an interview with a tech guru and stops an attack on him, he finds a mysterious ring that takes him back 57 seconds into the past. | 1304.978 | 2023-09-29 | 57 Seconds | 5.4 | 111 | 0 | [{'id': 189103, 'logo_path': '/hu0qcD4k7kfWpdAewqmJSUyZPa7.png', 'name': 'Ashland Hill Media Finance', 'origin_country': 'US'}, {'id': 176331, 'logo_path': None, 'name': 'BGG Capital', 'origin_country': ''}, {'id': 12029, 'logo_path': '/8PAf5K4VVI6xO9SjB7bxLtpi4xH.png', 'name': 'Curmudgeon Films', 'origin_country': 'US'}] | [{'iso_3166_1': 'US', 'name': 'United States of America'}] | [{'english_name': 'English', 'iso_639_1': 'en', 'name': 'English'}] | Rewind the past. Avenge the future. | 99 | 0 |
| GenreIds | Id | OriginalLanguage | OriginalTitle | Overview | Popularity | ReleaseDate | Title | VoteAverage | VoteCount | Budget | ProductionCompanies | ProductionCountries | SpokenLanguages | TagLine | RunTime | Revenue | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 9990 | [9648, 80, 53] | 11092 | en | Presumed Innocent | Rusty Sabich is a deputy prosecutor engaged in an obsessive affair with a coworker who is murdered. Soon after, he's accused of the crime. And his fight to clear his name becomes a whirlpool of lies and hidden passions. | 13.054 | 1990-07-27 | Presumed Innocent | 6.8 | 595 | 22000000 | [{'id': 932, 'logo_path': None, 'name': 'Mirage Enterprises', 'origin_country': 'US'}, {'id': 174, 'logo_path': '/IuAlhI9eVC9Z8UQWOIDdWRKSEJ.png', 'name': 'Warner Bros. Pictures', 'origin_country': 'US'}] | [{'iso_3166_1': 'US', 'name': 'United States of America'}] | [{'english_name': 'English', 'iso_639_1': 'en', 'name': 'English'}] | Some people would kill for love | 127 | 221303188 |
| 9991 | [18, 10770, 36] | 664423 | en | The Windermere Children | The story of the pioneering project to rehabilitate child survivors of the Holocaust on the shores of Lake Windermere. | 13.052 | 2020-01-27 | The Windermere Children | 7.5 | 96 | 0 | [{'id': 16049, 'logo_path': '/2hb0R2GJ6BvgwmyqO7fIgtjsakt.png', 'name': 'Wall to Wall', 'origin_country': 'GB'}, {'id': 109759, 'logo_path': '/na8NZKqFc8Ep9qAg8KQUcH1DpQl.png', 'name': 'Warner Bros. International Television Production Germany', 'origin_country': 'DE'}, {'id': 4606, 'logo_path': '/otZHbf1HmzLRQsZFSqJXkf8EHz7.png', 'name': 'ZDF', 'origin_country': 'DE'}, {'id': 11667, 'logo_path': '/dhBJdstAolGmqRfbQfElhsU1cTo.png', 'name': 'Northern Ireland Screen', 'origin_country': 'GB'}] | [{'iso_3166_1': 'DE', 'name': 'Germany'}, {'iso_3166_1': 'GB', 'name': 'United Kingdom'}] | [{'english_name': 'English', 'iso_639_1': 'en', 'name': 'English'}, {'english_name': 'German', 'iso_639_1': 'de', 'name': 'Deutsch'}] | NaN | 88 | 0 |
| 9992 | [27, 878] | 3077 | en | Son of Frankenstein | One of the sons of late Dr. Henry Frankenstein finds his father's ghoulish creation in a coma and revives him, only to find out the monster is controlled by Ygor who is bent on revenge. | 13.052 | 1939-01-13 | Son of Frankenstein | 6.7 | 205 | 420000 | [{'id': 33, 'logo_path': '/8lvHyhjr8oUKOOy2dKXoALWKdp0.png', 'name': 'Universal Pictures', 'origin_country': 'US'}] | [{'iso_3166_1': 'US', 'name': 'United States of America'}] | [{'english_name': 'English', 'iso_639_1': 'en', 'name': 'English'}] | The black shadows of the past bred this half-man . . . half-demon ! . . . creating a new and terrible juggernaut of destruction ! | 99 | 0 |
| 9993 | [18, 10749] | 413543 | hi | Dear Zindagi | An unconventional thinker helps a budding cinematographer gain a new perspective on life. | 13.051 | 2016-11-23 | Dear Zindagi | 7.1 | 210 | 4300000 | [{'id': 2343, 'logo_path': '/fkrlAFxgAtIHgYJGIQcDggeucKV.png', 'name': 'Red Chillies Entertainment', 'origin_country': 'IN'}, {'id': 19146, 'logo_path': '/5Ff25ornzVNhm5skuAvMAR556NB.png', 'name': 'Dharma Productions', 'origin_country': 'IN'}, {'id': 78597, 'logo_path': None, 'name': 'Hope Productions', 'origin_country': 'IN'}] | [{'iso_3166_1': 'IN', 'name': 'India'}] | [{'english_name': 'English', 'iso_639_1': 'en', 'name': 'English'}, {'english_name': 'Hindi', 'iso_639_1': 'hi', 'name': 'हिन्दी'}] | NaN | 151 | 3376375 |
| 9994 | [12, 18, 28, 53] | 14400 | fr | Largo Winch | After a powerful billionaire is murdered, his secret adoptive son must race to prove his legitimacy, find his father's killers and stop them from taking over his financial empire. | 13.051 | 2008-12-17 | The Heir Apparent: Largo Winch | 6.0 | 486 | 25412760 | [{'id': 6750, 'logo_path': None, 'name': 'Pan-Européenne', 'origin_country': 'FR'}, {'id': 856, 'logo_path': '/3tfzS2CrX6Ntbu927XzHXEPDA6y.png', 'name': 'Wild Bunch', 'origin_country': 'FR'}, {'id': 356, 'logo_path': '/tSJvuFaLIp7l0ONLUiAHA61GbXu.png', 'name': 'TF1 Films Production', 'origin_country': 'FR'}, {'id': 139617, 'logo_path': None, 'name': 'Araneo', 'origin_country': 'BE'}, {'id': 104, 'logo_path': '/9aotxauvc9685tq9pTcRJszuT06.png', 'name': 'Canal+', 'origin_country': 'FR'}, {'id': 14362, 'logo_path': None, 'name': 'October Pictures', 'origin_country': 'HK'}] | [{'iso_3166_1': 'BE', 'name': 'Belgium'}, {'iso_3166_1': 'FR', 'name': 'France'}, {'iso_3166_1': 'HK', 'name': 'Hong Kong'}] | [{'english_name': 'French', 'iso_639_1': 'fr', 'name': 'Français'}, {'english_name': 'English', 'iso_639_1': 'en', 'name': 'English'}, {'english_name': 'Serbian', 'iso_639_1': 'sr', 'name': 'Srpski'}, {'english_name': 'Portuguese', 'iso_639_1': 'pt', 'name': 'Português'}] | NaN | 108 | 0 |
| 9995 | [28, 80, 53] | 2749 | en | 15 Minutes | When Eastern European criminals Oleg and Emil come to New York City to pick up their share of a heist score, Oleg steals a video camera and starts filming their activities, both legal and illegal. When they learn how the American media circus can make a remorseless killer look like the victim and make them rich, they target media-savvy NYPD Homicide Detective Eddie Flemming and media-naive FDNY Fire Marshal Jordy Warsaw, the cops investigating their murder and torching of their former criminal partner, filming everything to sell to the local tabloid TV show "Top Story." | 13.051 | 2001-03-01 | 15 Minutes | 5.9 | 646 | 60000000 | [{'id': 376, 'logo_path': None, 'name': 'Industry Entertainment', 'origin_country': ''}, {'id': 11391, 'logo_path': '/t6m0uRTzaFHCsvEpikENE0PWJGb.png', 'name': 'Tribeca Productions', 'origin_country': 'US'}, {'id': 67171, 'logo_path': None, 'name': 'New Redemption Pictures', 'origin_country': ''}] | [{'iso_3166_1': 'DE', 'name': 'Germany'}, {'iso_3166_1': 'US', 'name': 'United States of America'}] | [{'english_name': 'English', 'iso_639_1': 'en', 'name': 'English'}, {'english_name': 'Greek', 'iso_639_1': 'el', 'name': 'ελληνικά'}, {'english_name': 'Russian', 'iso_639_1': 'ru', 'name': 'Pусский'}, {'english_name': 'Czech', 'iso_639_1': 'cs', 'name': 'Český'}] | America Likes to Watch | 120 | 56359980 |
| 9996 | [18, 28, 53] | 11128 | en | Ladder 49 | Under the watchful eye of his mentor, Captain Mike Kennedy, probationary firefighter Jack Morrison matures into a seasoned veteran at a Baltimore fire station. However, Jack has reached a crossroads as the sacrifices he's made have put him in harm's way innumerable times and significantly impacted his relationship with his wife and kids. | 13.050 | 2004-10-01 | Ladder 49 | 6.4 | 707 | 60000000 | [{'id': 919, 'logo_path': None, 'name': 'Beacon Communications', 'origin_country': ''}, {'id': 9195, 'logo_path': '/ou5BUbtulr6tIt699q6xJiEQTR9.png', 'name': 'Touchstone Pictures', 'origin_country': 'US'}, {'id': 10157, 'logo_path': None, 'name': 'Beacon Pictures', 'origin_country': ''}, {'id': 877, 'logo_path': None, 'name': 'Casey Silver Productions', 'origin_country': 'US'}, {'id': 53987, 'logo_path': None, 'name': 'Fantail Films Inc.', 'origin_country': ''}] | [{'iso_3166_1': 'US', 'name': 'United States of America'}] | [{'english_name': 'English', 'iso_639_1': 'en', 'name': 'English'}] | Their greatest challenge lies in rescuing one of their own | 115 | 74541707 |
| 9997 | [18, 35] | 484482 | fr | Le Grand Bain | 40-year-old Bertrand has been suffering from depression for the last two years and is barely able to keep his head above water. Despite the medication he gulps down all day, every day, and his wife's encouragement, he is unable to find any meaning in his life. Curiously, he will end up finding this sense of purpose at the swimming pool, by joining an all-male synchronised swimming team. | 13.049 | 2018-10-24 | Sink or Swim | 6.9 | 1398 | 0 | [{'id': 34780, 'logo_path': '/ahgzFRXKF2fkOV0WfB0RCxEmxki.png', 'name': 'Chi-Fou-Mi Productions', 'origin_country': 'FR'}, {'id': 2612, 'logo_path': '/3ulBLchjjnjVmvyLQSwO62MDKLE.png', 'name': 'Les Productions du Trésor', 'origin_country': 'FR'}] | [{'iso_3166_1': 'BE', 'name': 'Belgium'}, {'iso_3166_1': 'FR', 'name': 'France'}] | [{'english_name': 'French', 'iso_639_1': 'fr', 'name': 'Français'}] | NaN | 122 | 0 |
| 9998 | [18] | 453755 | en | Arctic | A man stranded in the Arctic is finally about to receive his long awaited rescue. However, after a tragic accident, his opportunity is lost and he must then decide whether to remain in the relative safety of his camp or embark on a deadly trek through the unknown for potential salvation. | 13.049 | 2018-11-21 | Arctic | 6.5 | 1095 | 2000000 | [{'id': 35849, 'logo_path': '/bHYIJoy2ri7crfHugwR0AdF3qdM.png', 'name': 'Armory Films', 'origin_country': 'US'}, {'id': 12496, 'logo_path': '/tk6stgFevcJ2iWyd0IaEfVStNqL.png', 'name': 'Pegasus Pictures', 'origin_country': 'IS'}, {'id': 21775, 'logo_path': None, 'name': 'Union Entertainment Group', 'origin_country': ''}, {'id': 120207, 'logo_path': None, 'name': 'The Domain Group', 'origin_country': ''}, {'id': 12142, 'logo_path': '/rPnEeMwxjI6rYMGqkWqIWwIJXxi.png', 'name': 'XYZ Films', 'origin_country': 'US'}] | [{'iso_3166_1': 'IS', 'name': 'Iceland'}, {'iso_3166_1': 'US', 'name': 'United States of America'}] | [{'english_name': 'Danish', 'iso_639_1': 'da', 'name': 'Dansk'}, {'english_name': 'English', 'iso_639_1': 'en', 'name': 'English'}] | Survival is the only option | 98 | 4100000 |
| 9999 | [10402, 99, 10751] | 54518 | en | Justin Bieber: Never Say Never | Tells the story of Justin Bieber, the kid from Canada with the hair, the smile and the voice: It chronicles his unprecedented rise to fame, all the way from busking in the streets of Stratford, Canada to putting videos on YouTube to selling out Madison Square Garden in New York as the headline act during the My World Tour from 2010. It features Usher, Scooter Braun, Ludacris, Sean Kingston, Antonio "L.A." Reid, Boyz II Men, Miley Cyrus, Jaden Smith, Justin's family members and parts of his crew and huge fanbase in a mix of interviews and guest performances. It was released in 3D in theaters all around the world and is the highest grossing concert movie of all time, beating the previous record held by Michael Jackson's This Is It from 2009. | 13.049 | 2011-02-11 | Justin Bieber: Never Say Never | 5.2 | 378 | 13000000 | [{'id': 7377, 'logo_path': '/zdVYfWyiQmLgyq8V73HRHpCWjRO.png', 'name': 'Insurge Pictures', 'origin_country': 'US'}, {'id': 7378, 'logo_path': None, 'name': 'Magical Elves', 'origin_country': 'US'}, {'id': 7379, 'logo_path': '/gwpgBGRP4nXn3M6gXUV6W02uKGS.png', 'name': 'Scooter Braun Films', 'origin_country': 'US'}, {'id': 4, 'logo_path': '/gz66EfNoYPqHTYI4q9UEN4CbHRc.png', 'name': 'Paramount', 'origin_country': 'US'}, {'id': 162509, 'logo_path': None, 'name': 'L.A. Reid Media', 'origin_country': ''}, {'id': 87041, 'logo_path': None, 'name': 'AEG Live', 'origin_country': ''}, {'id': 746, 'logo_path': '/kc7bdIVTBkJYy9aDK1QDDTAL463.png', 'name': 'MTV Films', 'origin_country': 'US'}, {'id': 78156, 'logo_path': None, 'name': 'The Island Def Jam Music Group', 'origin_country': ''}] | [{'iso_3166_1': 'US', 'name': 'United States of America'}] | [{'english_name': 'English', 'iso_639_1': 'en', 'name': 'English'}] | Find out what's possible if you never give up. | 105 | 98500000 |